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For: a-Amylase Mutants 

PRELIMINARY AMENDMENT 

Assistant Commissioner for Patents 
Washington, DC 20231 

Sir: 

Before the above-captioned application is taken up for examination, entry of 
the following amendment is respectfully requested: 

IN THE SPECIFICATION: 

At page 1, before the first line, insert 
-CROSS-REFERENCE TO RELATED APPLICATIONS 

This application is a continuation of U.S. application serial no. 08/600,908 filed 
February 13, 1996 (now allowed), which is a continuation of PCT/DK96/00057 filed February 
5, 1996, and claims priority under 35 U.S.C. 119 of Danish applications 1256/95, 1192/95, 
and 0128/95 filed November 10, 1995, October 23, 1995, and February 3, 1995, respectively, 
the contents of which are fully incorporated herein by reference.— 

On page 5, line 9, after "WO 95/26397", insert -(SEQ ID NO: 1 of WO 
95/26397 is shown in Fig. 8 of the instant application (SEQ ID NO: 12))--. 

On page 8, line 28, delete "the Figures" and insert -Figures 1-6—. 

On page 43, lines 2 and 9 (both occurrences), delete "SEQ ID NO: 1" and 
insert -SEQ ID NO: 2». 

On page 65, at the end of line 3, please insert -(SEQ ID NO: 7)-. 

On page 65, at the end of line 6, please insert -(SEQ ID NO: 8)-. 



On page 65, at the end of line 8, please insert ~(SEQ ID NO: 9)~. 

At page 65, line 27, after "(Novo Nordisk A/S)", insert --SEQ ID NO:2--. 

At page 67, line 21 , after "WO 95/26397", insert --(SEQ ID NO:l of WO 
95/26397 is shown in Fig. 8 of the instant application, SEQ ID NO: 12)-. 

Please delete pages 69-75 (Sequence Listing), insert the substitute Sequence 
Listing appended herewith (marked as pages 69-89), and re-number the subsequent pages 
accordingly. 

IN THE CLAIMS: 

Cancel claims 1-70 without prejudice or disclaimer. 
Add new claims 71 and 72 reading as follows: 

—IV A method of producing a variant of a parent alpha-amylase having an 
altered property relative to the parent, wherein the parent alpha-amylase has the sequence of 
SEQ ID Nos: 2, 4, 6, or 13, or has a sequence at least 70 % homologous to the sequence of 
SEQ ID Nos: 2, 4, 6, or 13 when homology is determined by the GAP program (Genetic 
Computer Group, Version 7.0) using default values for GAP penalties, said method 
comprising 

(a) modelling the parent alpha-amylase on an X-ray crystallographic three- 
dimensional structure of an alpha-amylase having the sequence of SEQ ID Nos: 2, 4, 6, or 
13, or having a sequence at least 70 % homologous to the sequence of SEQ ID Nos: 2, 4, 6, 
or 13 when homology is determined by the GAP program using default values for GAP 
penalties, to produce a three-dimensional structure of the parent alpha-amylase; 

(b) identifying in the three-dimensional structure obtained in step (a) at least 
one structural part of the parent wherein an alteration in said structural part is predicted to 
result in said altered property; 

(c) modifying the sequence of a nucleic acid encoding the parent alpha- 
amylase to produce a nucleic acid encoding a deletion, insertion, or substitution of one or 
more amino acids at a position corresponding to said structural part; and 

(d) expressing the modified nucleic acid to produce the variant alpha-amylase, 
wherein the variant has alpha-amylase enzymatic activity and has at least one altered property 
relative to the parent. — 
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-/72. A method of constructing a variant of a parent alpha-amylase having an 
altered propeny relative to the parent, wherein the parent alpha-amylase has the sequence of 
SEQ ID Nos: 2, 4, 6, or 13 or has a sequence at least 70% homologous to the sequence of 
SEQ ID Nos: 2, 4, 6, or 13 when homology is determined by the GAP program (Genetic 
Computer Group, Version 7.0) using default values for GAP penalties, said method 
comprising 

(a) modelling the parent alpha-amylase on an X-ray crystallographic three- 
dimensional structure of an alpha-amylase having the sequence of SEQ ID Nos: 2, 4, 6, or 
13, or having a sequence at least 70% homologous to the sequence of SEQ ID Nos: 2, 4, 6, 
or 13 when homology is determined by the GAP program using default values for GAP 
penalties, to produce a three-dimensional structure of the parent alpha-amylase; 

(b) comparing the three-dimensional structure obtained in step (a) with a three- 
dimensional structure of an unrelated alpha-amylase, wherein the unrelated alpha-amylase 
differs from the parent alpha-amylase in said property; 

(c) identifying a structural part of the three-dimensional structure obtained in 
step (a) which is different from the three-dimensional structure of the unrelated alpha-amylase 
and which is predicted to be relevant to said property, 

(d) modifying the sequence of a nucleic acid encoding the parent alpha- 
amylase to produce a nucleic acid encoding a deletion, insertion, or substitution of one or 
more amino acids at a position corresponding to said structural part; and 

(e) expressing the modified nucleic acid to produce the variant alpha-amylase, 
wherein the variant has alpha-amylase activity and has one or more altered properties as 
compared to the parent alpha-amylase. — 

REMARKS 

Entry of this amendment is respectfully requested. 

In this response, claims 1-70 are cancelled without prejudice and new claims 
71 and 72 are added. Support for the new claims can be found in the specification and 
original claims. For example, the definition of a parent (Termamyl-like) alpha-amylase can 
be found in the specification at page 5, lines 4-20. The process of modelling a parent alpha- 
amylase on a three-dimensional structure is described on page 12, lines 7-24; and on page 62, 
line 4 - page 63, line 5 (Example 1). Production of a variant alpha-amylase by modification 
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of a nucleic acid sequence and expression of the modified sequence is disclosed on page 13, 
lines 12-20. Alpha-amylases unrelated to the parent alpha-amylases as defined in the present 
claims are described at page 15, lines 11-17. No new matter is added. Accordingly, claims 
71 and 72 are pending and at issue. 

Sequence Listing 

Appended herewith is a request to transfer the Sequence Listing from the parent 
application to this application. 

The content of the attached paper entitled "SEQUENCE LISTING" is identical 
to the information in the specification as originally filed. No new matter is added. 

Rejection of Claims Under 35 U.S.C. § 103(a) 

In the parent application, claims similar in scope to present claims 71 and 72 
were rejected under 35 U.S.C. § 103(a) as unpatentable over Machius et al., J. Mol Biol 
246:545, 1995, and MacGregor, J. ProL Chem. 7, 399, 1988, further in view of Svendsen et 
aL, WO 94/02597 (see, Office Action of December 8, 1998, in application serial no. 
08/600,908, now allowed). The Examiner contended that Machius et al. discloses the 
structure of calcium-depleted a-amylase from B. licheniformis\ MacGregor teaches how to 
predict a-amylase structure; Svendsen et al. teaches recombinant production of a-amylases; 
and that it would have. been been obvious to combine the teachings in these references to 
achieve the claimed invention. The Examiner also contended that Applicants were only 
entitled to the priority date of October 23, 1995, for the three-dimensional structure disclosed 
in the Appendix of the present application. This rejection is respectfully traversed. 

Attached herewith is a certified copy of the first priority application, DK 
0128/95, which was filed on February 3, 1995. The Examiner's attention is directed to the 
Appendix (following the figures), showing that the coordinates describing the three- 
dimensional structure of a Bacillus a-amylase were disclosed in this application. It is believed 
on this basis that Applicants are entitled to the priority date of February 3, 1995 for the 
presently claimed invention. 

On this basis, it is respectfully submitted that Macchius et al., which was 
published in March, 1995, is not a reference against the present claims. 



As discussed with the Examiner in an interview with Applicants' representative 
and the inventors on May 27, 1999, MacGregor provides almost no useful information 
regarding the structure of Bacillus a-amylases beyond elucidating the central (3 -barrel structure 
and the structure of some of the surrounding a-helices. Thus, one of ordinary skill in the art, 
based on MacGregor, with or without Svensson et al., could have no reasonable expectation 
of achieving a-amylase variants with predictively altered properties. 

Accordingly, it is respectfully submitted that present claims 71 and 72 are free 
of the prior art. 

Furthermore, and as further discussed at the May 27, 1998, interview, 
Applicants believe that the striking structural homologies between a-amylases belonging to 
the "Termamyl-like" a-amylase family (i.e., Gf-amylases having the sequence of SEQ ID Nos: 
2, 4, 6, or 1 3 of the present specification or having a sequence at least 70% homologous to 
the sequence of SEQ ID Nos: 2, 4, 6, or 13, as required by the present claims) fully support 
the scope of the present claims. 

In view of the above amendments and remarks, it is believed that the claims 
are in condition for allowance, and a determination to that effect is earnestly solicited. 

Respectfully submitted, 



Date: June 3, 1999 



Icza Green, Reg. No. 38,475 
Novo Nordisk of North America, Inc. 
405 Lexington Avenue, Suite 6400 
New York, NY 10174-6401 
(212) 867-0123 
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a- AMYLASE MUTANTS 

FIELD OF THE INVENTION 

The present invention relates to a novel method of designing a-amylase 
mutants with predetermined properties, which method is based on the hitherto unknown 
three-dimensional structure of bacterial a-amylases. 

BACKGROUND OF THE INVENTION 

a- Amylases (a-1 ,4 glucan-4-glucanohydrolase, EC 3.2. 1 . 1) constitute a group 
of enzymes which is capable of hydrolyzing starch and other linear and branched 
1,4-glucosidic oligo- and polysaccharides. Almost all a-amylases studied have a few 
conserved regions with approximately the same length and spacing. One of these regions 
resembles the Ca 2+ binding site of calmodulin and the others are thought to be necessary for 
the active center and/or binding of the substrate. 

While the amino acid sequence and thus primary structure of a large number 
of a-amylases are known, it has proved very difficult to determine the three-dimensional 
structure of all a-amylases. The three-dimensional structure can be determined by X-ray 
crystallographic analysis of a-amylase crystals, but it has proven difficult to obtain a- amylase 
crystals suitable for actually solving the structure. 

Until now the three-dimensional structure of only a few a-amylases have been 
determined at high resolution. These include the structure of the Aspergillus oryzae TAKA 
a-amylase (Swift et al., 1991), the Aspergillus niger acid amylase (Brady et al., 1991), the 
structure of pig pancreatic a-amylase (Qian et al., 1993), and the barley alpha-amylase 
(Kadziola et al. 1994, Journal of Molecular Biology 239: 104-121, A.Kadziola, Thesis, Dept 
of Chemistry, U. of Copenhagen, Denmark). Furthermore, the three-dimensional structure 
of a Bacillus circulans cyclodextrin glycosyltransferase (CGTase) is known (Klein et al., 
1992) (Lawson et aL, 1994). The CGTase catalyzes the same type of reactions as 
a-amylases and exhibits some structural resemblance with a-amylases. 

Furthermore, crystallization and preliminary X-ray studies of B. subtilis 
a-amylases have been described (Chang et al. (1992) and Mizuno et al. (1993)). No final 
5. subtilis structure has been reported. Analogously, the preparation of 5. licheniformis 



a-amylase crystals has been reported (Suzuki et aL (1990), but no subsequent report on 
X-ray crystallographic analysis or three-dimensional structure are available. 

Several research teams have attempted to build three-dimensional structures 
on the basis of the above known a- amylase structures. For instance, Vihinen et aL (J. 
Biochem. 107, 267-272, 1990), disclose the modelling (or computer simulation) of a 
three-dimensional structure of the Bacillus stearothermophilus a-amylase on the basis of the 
TAKA amylase structure. The model was used to investigate hypothetical structural 
consequences of various site-directed mutations of the B. stearothermophilus a-amylase. 
E. A. MacGregor (1987) predicts the presence of a-helices and B-barrels in a-amylases from 
different sources, including barley, pig pancreas and Bacillus amyloliquefaciens on the basis 
of the known structure of the A. oryzae TAKA a-amylase and secondary structure predicting 
algorithms. Furthermore, the possible loops and subsites which may be found to be present 
in, e.g., the B. amyloliquefaciens a-amylase are predicted (based on a comparison with the 
A. oryzae sequence and structure). 

A.E. MacGregor (Starch/Starke 45 (1993), No. 7, p. 232-237) presents a 
review of the relationship between the structure and activity of a-amylase related enzymes. 

Hitherto, no three-dimensional structure has been available for the industrially 
important Bacillus a-amylases (which in the present context are termed "Termamyl-like a- 
amylases"), including the B. licheniformis, the B. amyloliquefaciens, and the B. 
stearothermophilus a-amylase. 

BRIEF DISCLOSURE OF THE INVENTION 

The three-dimensional structure of a Termamyl-like bacterial a-amylase has 
now been elucidated. On the basis of an analysis of said structure it is possible to identify 
structural parts or specific amino acid residues which from structural or functional 
considerations appear to be important for conferring the various properties to the Termamyl- 
like a-amylases. Furthermore, when comparing the Termamyl-like a-amylase structure with 
known structures of the fungal and mammalian a-amylases mentioned above, it has been 
found that some similarities exist between the structures, but also that some striking, and not 
previously predicted structural differences between the a-amylases exist. The present 
invention is based on these findings. 



Accordingly, in a first aspect the invention relates to a method of constructing 
a variant of a parent Termamyl-like a-amylase, which variant has a-amylase activity and at 
least one altered property as compared to said parent a-amylase , which method comprises 

i) analyzing the structure of the Termamyl-like a-amylase with a view to 
identifying at least one amino acid residue or at least one structural part of the Termamyl-like 
a-amylase structure, which amino acid residue or structural part is believed to be of 
relevance for altering said property of the parent Termamyl-like a-amylase (as evaluated on 
the basis of structural or functional considerations), 

ii) constructing a Termamyl-like a-amylase variant, which as compared to the 
parent Termamyl-like a-amylase, has been modified in the amino acid residue or structural 
part identified in i) so as to alter said property, and 

iii) testing the resulting Termamyl-like a-amylase variant for said property. 
In a second aspect the present invention relates to a method of constructing a 

variant of a parent Termamyl-like a-amylase, which variant has a-amylase activity and one 
or more altered properties as compared to said parent a-amylase, which method comprises 

i) comparing the three-dimensional structure of the Termamyl-like a-amylase 
with the structure of a non-Termamyl-like a-amylase, 

ii) identifying a part of the Termamyl-like a-amylase structure which is 
different from the non-Termamyl-like a-amylase structure, and 

iii) modifying the part of the Termamyl-like a-amylase identified in ii) 
whereby a Termamyl-like a-amylase variant is obtained, one or more properties of which 
differ from the parent Termamyl-like a-amylase. 

In a third aspect the invention relates to a method of constructing a variant of 
a parent non-Termamyl-like a-amylase, which variant has a-amylase activity and one or 
more altered properties as compared to said parent a-amylase, which method comprises 

i) comparing the three-dimensional structure of the non-Termamyl-like a- 
amylase with the structure of a Termamyl-like a-amylase, 

ii) identifying a part of the non-Termamyl-like a-amylase structure which is 
different from the Termamyl-like a-amylase structure, and 

iii) modifying the part of the non-Termamyl-like a-amylase identified in ii) 
whereby a non-Termamyl-like a-amylase variant is obtained, one or more properties of 
which differ from the parent Termamyl-like a-amylase. 



The property which may be altered by the above methods of the present 
invention may, e.g., be substrate specificity, substrate binding, substrate cleavage pattern, 
temperature stability, pH dependent activity, pH dependent stability (especially increased 
stability at low (e.g. pH<6, in particular pH<5) or high (e.g. pH>9) pH values), stability 
towards oxidation, Ca 24 " -dependency, specific activity, and other properties of interest. For 
instance, the alteration may result in a variant which, as compared to the parent Termamyl- 
like a-amylase, has an increased specific activity at a given pH and/or an altered substrate 
specificity. 

In still further aspects the invention relates to variants of a Termamyl-like a- 
amylase, DNA encoding such variants and methods of preparing the variants. Finally, the 
invention relates to the use of the variants for various industrial purposes. 

DETAILED DISCLOSURE OF THE INVENTION 
The Termamyl-like q-amvlase 

It is well known that a number of alpha-amylases produced by Bacillus spp. 
are highly homologous on the amino acid level. For instance, the B. licheniformis a- amylase 
comprising the amino acid sequence shown in SEQ ID No. 2 (commercially available as 
Termamyl®) has been found to be about 89% homologous with the B, amyloliquefaciens a- 
amylase comprising the amino acid sequence shown in SEQ ID No. 4 and about 79% 
homologous with the B. stearothermophilus a-amylase comprising the amino acid sequence 
shown in SEQ ID No. 6. Further homologous a-amylases include an a-amylase derived 
from a strain of the Bacillus sp. NCIB 12289, NCIB 12512, NCIB 12513 or DSM 9375, all 
of which are described in detail in WO 95/26397, and the a-amylase described by Tsukamoto 
etal., 1988, Biochemical and Biophysical Research Communications, Vol. 151, No. 1. Still 
other homologous a-amylases include the a-amylase produced by the B. licheniformis 
described in EP 252 666 (ATCC 27811), and the a-amylases identified in WO 91/00353 and 
WO 94/18314. Other commercial Termamyl-like B. licheniformis a-amylases are Optitherm® 
and Takatherm® (available from Solvay), Maxamyl® (available from Gist- 
brocades/Genencor), Spezym AA® (available from Genencor), and Keistase® (available from 
Daiwa) . 



Because of the substantial homology found between these a-amylases, they are 
considered to belong to the same class of a-amylases, namely the class of "Termamyl-like 
a-amylases". 

Accordingly, in the present context, the term "Termamyl-like a-amylase" is 
intended to indicate an a-amylase which, on the amino acid level, exhibits a substantial 
homology to Termamyl®, i.e. the B. licheniformis a-amylase SEQ ID NO 2. In other words, 
a Termamyl-like a-amylase is an a-amylase, which has the amino acid sequence shown in 
SEQ ID No. 2, 4 or 6 herein, or the amino acid sequence shown in SEQ ID NO 1 or 2 of 
WO 95/26397 or in Tsukamoto et al., 1988, or i) which displays at least 60%, such as at 
least 70%, e.g. at least 75%, or at least 80%, e.g. at least 85%, at least 90% or at least 95% 
homology with at least one of said amino acid sequences and/or ii) displays immunological 
cross-reactivity with an antibody raised against at least one of said a-amylases, and/or iii) 
is encoded by a DNA sequence which hybridizes to the DNA sequences encoding the above 
specified a-amylases which are apparent from SEQ ID Nos. 1, 3 and 5 of the present 
application, and SEQ ID NO 4 and 5 of WO 95/26397, respectively. 

In connection with property i) the "homology" may be determined by use of 
any conventional algorithm, preferably by use of the GAP progamme from the GCG package 
version 7.3 (June 1993) using default values for GAP penalties (Genetic Computer Group 
(1991) Programme Manual for the GCG Package, version 7, 575 Science Drive, Madison, 

Wisconsin, USA 53711). 

Property ii) of the a-amylase, i.e. the immunological cross reactivity, may be 
assayed using an antibody raised against or reactive with at least one epitope of the relevant 
Termamyl-like a-amylase. The antibody, which may either be monoclonal or polyclonal, 
may be produced by methods known in the art, e.g. as described by Hudson et al., 1989. 
The immunological cross-reactivity may be determined using assays known in the art, 
examples of which are Western Blotting or radial immunodiffusion assay, e.g. as described 
by Hudson et al., 1989. In this respect, immunological cross-reactivity between the a- 
amylases having the amino acid sequences SEQ ID Nos. 2, 4 and 6, respectively, has been 
found. 

The oligonucleotide probe used in the characterization of the Termamyl-like 
a-amylase in accordance with property iii) above may suitably be prepared on the basis of 
the full or partial nucleotide or amino acid sequence of the a-amylase in question. Suitable 



conditions for testing hybridization involve presoaking in 5xSSC and prehybridizing for 1 h 
at — 40°C in a solution of 20% formamide, SxDenhardt's solution, 50 mM sodium 
phosphate, pH 6.8, and 50 fig of denatured sonicated calf thymus DNA, followed by 
hybridization in the same solution supplemented with 100 fjM ATP for 18 h at -40 °C, or 
5 other methods described by e.g. Sambrook et ah, 1989. 

In the present context, "derived from" is intended not only to indicate an a- 
amylase produced or producible by a strain of the organism in question, but also an 
a-amylase encoded by a DNA sequence isolated from such strain and produced in a host 
organism transformed with said DNA sequence. Finally, the term is intended to indicate an 

10 a-amylase which is encoded by a DNA sequence of synthetic and/or cDNA origin and which 
has the identifying characteristics of the a-amylase in question. The term is also intended 

3 to indicate that the parent a-amylase may be a variant of a naturally occurring a-amylase, 

y i.e. a variant which is the result of a modification (insertion, substitution, deletion) of one 

1= or more amino acid residues of the naturally occurring a-amylase. 

I 5 

y Parent hybrid a-amylases 

n The parent a-amylase (being a Termamyl-like or non-Termamyl-like a- 

0 amylase) may be a hybrid a-amylase, i.e. an a-amylase which comprises a combination of 

U partial amino acid sequences derived from at least two a-amylases. 

!| o The parent hybrid a-amylase may be one which on the basis of amino acid 

homology and/or immunological cross-reactivity and/or DNA hybridization (as defined 
above) can be determined to belong to the Termamyl-like a-amylase family. In this case, 
the hybrid a-amylase is typically composed of at least one part of a Termamyl-like a-amylase 
and part(s) of one or more other a-amylases selected from Termamyl-like a-amylases or non- 

2 5 Termamyl-like a-amylases of microbial (bacterial or fungal) and/or mammalian origin. 

Thus, the parent hybrid a-amylase may comprise a combination of at least two 
Termamyl-like a-amylases, or of at least one Termamyl-like and at least one non-Termamyl- 
like bacterial a-amylase, or of at least one Termamyl-like and at least one fungal a-amylase. 
For instance, the parent a-amylase comprises a C-terminal part of an a-amylase derived from 

3 0 a strain of B. licheniformis and a N-terminal part of an a-amylase derived from a strain of 

5. amyloliquefaciens or from a strain of B. stearothermophilus . For instance, the parent a- 
amylase comprises at least 430 amino acid residues of the C-terminal part of the B. 
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licheniformis ^-amylase, and may, e.g. comprise a) an amino acid segment corresponding 
to the 37 N-terminal amino acid residues of the B. amyloliquefaciens a-amylase having the 
amino acid sequence shown in SEQ ID No. 4 and an amino acid segment corresponding to 
the 445 C-terminal amino acid residues of the B. licheniformis a-amylase having the amino 
acid sequence shown in SEQ ID No. 2, or b) an amino acid segment corresponding to the 
68 N-terminal amino acid residues of the B. stearothermophilus a-amylase having the amino 
acid sequence shown in SEQ ID No. 6 and an amino acid segment corresponding to the 415 
C-terminal amino acid residues of the 5. licheniformis^ a-amylase having the amino acid 
sequence shown in SEQ ID No. 2. 

Analogously, the parent hybrid a-amylase may belong to a non-Termamyl-like 
a-amylase family, e.g. the Fungamyl-like a-amylase family. In that case the hybrid may 
comprise at least one part of an a-amylase belonging to the non-Termamyl-like a-amylase 
family in combination with one or more parts derived from other a-amylases. 

The three-dimensional Termamvl-like a-amvlase structure 

The Termamyl-like a-amylase which was used to elucidate the three- 
dimensional structure forming the basis for the present invention consists of the 300 
N-terminal amino acids of the B. amyloliquefaciens a-amylase (with the amino acid sequence 
shown in SEQ ID No. 4) and amino acids 301-483 of the C-terminal end of the B. 
licheniformis a-amylase with the amino acid sequence SEQ ID No. 2. The bacterial a- 
amylase belongs to the "Termamyl-like a-amylase family" and the present structure is 
believed to be representative for the structure of any Termamyl-like a-amylase. 

The structure of the a-amylase was solved in accordance with the principle for 
X-ray crystallographic methods given in "X-Ray Structure Determination", Stout, G.K. and 
Jensen, L.H., John Wiley & Sons, inc. NY, 1989. The structural coordinates for the solved 
crystal structure of the a-amylase at 2.2 A resolution using the isomorphous replacement 
method are given in a standard PDB format (Brookhaven Protein Data Base) in Appendix 1. 
It is to be understood that Appendix 1 forms part of the present application. 

Amino acid residues of the enzyme are identified by three-letter amino acid 

code (capitalized letters). 

The a-amylase structure is made up of three globular domains ordered A, B, 
and C with respect to sequence, which lie approximately along a line in the order B, A, C. 



The domains can be defined as being residues 1-103 and 206-395 for domain A, residues 
104-205 for domain B, and residues 396-483 for domain C, the numbers referring to the B. 
licheniformis a-amylase. This gives rise to an elongated molecule, the longest axis being 
about 85 A. The widest point perpendicular to this axis is approximately 50 A and spans the 
central A domain. The active site residues of the B. licheniformis a-amylase (SEQ ID NO 
2) are D323, D231 and E261. 

Domain A 

Domain A is the largest domain and contains the active site (comprised of a 
cluster of three amino acid residues placed at the bottom of a deep cleft in the enzyme's 
surface). Domain A of all known a-amylase structures have the same overall fold, viz. the 
(beta/alpha)8 barrel with 8 central beta strands (number 1-8) and 8 flanking a-helices. The 
j3-barrel is defined by McGregor op. cit. The C-terminal end of Beta strand 1 is connected 
to helix 1 by a loop denoted loop 1 and an identical pattern is found for the other loops. 
These loops show some variation in size and some can be quite extensive. 

The 8 central Beta-strands in the (beta/alpha) 8 barrel superimpose well 
between the various known a-amylase structures, and this part of the structure, including the 
close surroundings of the active site located at the c-terminal end of the beta-strands, show 
high similarity between the different amylases. 

The loops connecting beta-strands and alpha helices display high variations 
between alpha amylases. These loops constitute the structural context of the active site and 
the majority of the contacts to the substrate is found among residues located in these loops. 
Such important characteristics as substrate specificity, substrate binding, pH/activity profile, 
starch cleavage pattern are determined by the amino acids and the positions of same in these 
loops. 

The substantial differences between the Fungamyl-like a-amylase structure and 
the structure of the Termamyl-like a-amylase disclosed herein which are found in loops 1 , 
2, 3, and 8 are visualized in the Figures. 

Domain B 

The Termamyl-like a-amylase structure has been found to comprise a special 
domain structure in the A domain's loop3, also called domain B. The structure of the 



Termamyl-like a-amylase B domain has never been seen before in any of the known 
a-amylase or (j3/alpha)8-barrel proteins. 

The domain B structure is a very compact domain having a very high number 
of charged residues. The B domain arises as an extension of the loop between strand 3 and 
helix 3 of domain A (shown in Fig. 7) and contains a 5 stranded antiparallel /3-sheet structure 
containing at least one long loop structure and having the connectivity -1, +3, -IX, +2 
(Richardson, 1981, Adv. Protein Chem. 34, 167-339). 

The first four strands of the B domain form two hairpin loops which twist 
around each other like a pair of crossed fingers (right-hand twist). The main chain folds into 
a li-strand which connects two small B-sheet structures. After making one turn in one sheet 
it folds back and makes up a two stranded sheet in contact with domain A and an internal 
hole in the a-amylase structure. Then the main chain folds up to a small sheet structure 
nearly perpendicular to the first two sheets. Before entering the helix 3 on top of the 
6-strand 3, the approximately 24 last amino acids in domain B form two calcium binding 
sites in the contact region to domain A. 

Domain B is connected with domain A by two peptide stretches, which divide 
the domain-domain contact areas into two. Domain B is in contact with Domain A by a 
calcium binding region and an internally buried hole containing waters. Many types of 
molecular contacts are present. Ionic interacting between acid and basic amino acids are 
possible, these interactions are very important for the general stability at high pH and for 
keeping the Calcium binding sites intact. 

Domain C 

Domain C is the C-terminal part of the protein consisting of amino acids 
394-483. Domain C is composed entirely of j3-strands which forms a single 8-stranded sheet 
structure, which folds back on itself, and thus may be described as a /3-sandwich structure. 
The connectivity is +1, + 1, +5, -3, +1, +1, -3 although strands 6 and 7 are only loosely 
connected. One part of the /3-sheet forms the interface to domain A. 

Ca-binding and Na-binding sites 

The structure of the Termamyl-like a-amylase is remarkable in that it exhibits 
four calcium-binding sites and one sodium-binding site. In other words four calcium ions 



and one sodium ion are found to be present in the structure, although one of the calcium ions 
displays very weak coordination. Two of the calcium ions form part of a linear cluster of 
three ions, the central ion being attributed to sodium, which lie at the junction of the A and 
B domains. 

The coordinating residues for the calcium ions between the A and B domain 
are as follows (using the Pdb file nomenclature for amino acid residues and atoms in the Pdb 
file found in Appendix 1 herein): For the calcium ion nearest to the active site (IUM 502 in 
the pdb file), the backbone carbonyls from His235 and Aspl94, the sidechain atom OD1 
from residues Asp 194, Asnl02 and Asp200, and one water molecule WAT X3 (atom OW7). 
For the sodium ion (IUM 505), the binding site includes atom OD2 from Aspl94, Asp200, 
Aspl83 and Aspl59, and a backbone carbonyl from Val201. The coordinates for the other 
calcium ion between domain A and B are (IUM 501) : atom OD2 from Asp204 and Asp 159, 
backbone carbonyl from Aspl83 and Alal81, atom OD1 from Asp202, and one water 
molecule WAT X7 (atom OW7). 

One calcium ion is located between the A and C domain, another is located 
in the C domain. The first mentioned calcium ion, which is also the one best coordinated 
(IUM 503) includes a carbonyl backbone from Gly300, Tyr302 and His406, atom OD2/OD1 
from Asp430, atom OD1 from Asp407, and one water molecule WAT X6 (atom OW7). The 
other and very weakly coordinated calcium site (IUM 504) comprises 4 water molecules 
WAT X21 (atom OW8), X6 (atom OW6), X9 (atom OW0) and X28 (atom OW8), OE1/OE2 
from Glu447 and OD1 from Asn444. 



Substrate-binding site 

Without being limited to any theory it is presently believed that favorable 
interactions between a substrate molecule and the enzyme (such as hydrogen bonds and/or 
strong electrostatic interaction) are found within a sphere of 4 A of the substrate, when bound 
to the enzyme. The following residues of the B. licheniformis a-amylase having the amino 
acid sequence shown in SEQ ID No. 2 are contemplated to be within a distance of 4 A of 
the substrate and thus believed to be involved in interactions with the substrate: 
Trpl3, Tyrl4, Asnl7, AsplS, Ser50, Gln51, Ala52, Asp53, Val54, Gly55, Tyr56, Lys70, 
Arg74, Lys76, Vail 02, Hisl05, Glyl07, Glyl08, Ala 109, Trpl38, Thrl63, Aspl64, Trpl65, 
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Asnl72, Glul89, Tyrl93, Leul96, Met 197, Tyrl98, Alal99, Arg229, Asp231, Ala232, 
Lys234, His235, Glu261, Trp263, His327, Asp328, Gln333, Ser334, and Leu335. 

The amino acid residues of another Termamyl-like a-amylase, which are 
contemplated to be within a distance of 4 A of the substrate, may easily be identified by 
aligning the amino acid sequence SEQ ID NO 2 with that of the other Termamyl-like a- 
amylase and thereby identifying the positions equivalent to those identified above. 

Generality of structure 

Because of the high homology between the various Termamyl-like a-amylases, 
the solved structure defined by the coordinates of Appendix 1 is believed to be representative 
for the structure of all Termamyl-like a-amylases. A model structure of other Termamyl-like 
a-amylases may easily be built on the basis of the coordinates given in Appendix 1 adapted 
to the a-amylase in question by use of an alignment between the respective amino acid 
sequences. The creation of a model structure is exemplified in Example 1. 

The above identified structurally characteristic parts of the Termamyl-like a- 
amylase structure (Ca-binding site, substrate binding site, loops, etc.) may easily be identified 
in other Termamyl-like a-amylases on the basis of a model (or solved) structure of the 
relevant Termamyl-like a-amylase or simply on the basis of an alignment between the amino 
acid sequence of the Termamyl-like a-amylase in question with that of the B. licheniformis 
a-amylase used herein for identifying the amino acid residues of the respective structural 
elements. 

Furthermore, in connection with Termamyl-like variants of the invention, 
which are defined by modification of specific amino acid residues of a specific Termamyl- 
like a-amylase, it will be understood that variants of another Termamyl-like a-amylase 
modified in an equivalent position (as determined from the best possible amino acid sequence 
alignment between the respective sequences) are intended to be covered as well. Thus, 
irrespective of whether an amino acid residue is identified herein for the purpose of defining 
a structural part of a given a-amylase or used for identifying a variant of the a-amylase, this 
amino acid residue shall be considered as representing the equivalent amino acid residue of 
any other Termamyl-like a-amylase. 



Methods of the invention for design of novel q-amvlase variants 

In the methods according to the first, second and third aspects of the invention 

the terms "structure of a Termamyl-like a-amylase" and "Termamyl-like a-amylase 

structure" are intended to indicate the solved structure defined by the coordinates presented 
5 in Appendix 1 or a model structure of a given Termamyl-like a-amylase (such as the B. 

licheniformis a-amylase) built on the basis of the solved structure. 

In most cases the parent Termamyl-like a-amylase to be modified in 

accordance with the present invention is different from the a-amylase which was actually 

used for solving the structure (Appendix 1). This means that the amino acid residue(s) or 
10 structural part(s) identified in the solved structure (Appendix 1) in step i) of the method 

according to the first, second or third aspect of the invention must be translated into the 
□ corresponding amino acid residue(s) or structural part(s) of the parent Termamyl-like a- 

u amylase in question. The "translation" is conveniently performed on the basis of an amino 

acid sequence alignment between the amino acid sequence of the Termamyl-like a-amylase 
p5 used for solving the structure and the amino acid sequence of the parent Termamyl-like a- 
ki amylase in question. 

iU The analysis or comparison performed in step i) of the method according to 

the first, second and third aspect, respectively, of the invention may be performed by use of 
; : y any suitable computer program capable of analyzing and/or comparing protein structures, 

*| 0 e.g. the computer program Insight, available from Biosym Technologies, Inc. For instance, 
the basic principle of structure comparison is that the three-dimensional structures to be 
compared are superimposed on the basis of an alignment of secondary structure elements 
(such as the central 8 /3-strands in the barrel) and the parts differing between the structures 
can subsequently easily be identified from the superimposed structure. 

2 5 The structural part which is identified in step i) of the methods of the first, 

second and third aspects of the invention may be composed of one amino acid residue. 
However, normally the structural part comprises more than one amino acid residue, typically 
constituting one of the above parts of the Termamyl-like a-amylase structure such as one of 
the A, B, or C domains, an interface between any of these domains, a calcium binding site, 

3 0 a loop structure, the substrate binding site, or the like. 

In the present context the term "structural or functional considerations" is 
intended to indicate that modifications are made on the basis of an analysis of the relevant 
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structure or structural part and its contemplated impact on the function of the enzyme. Thus, 
an analysis of the structures of the various a-amylases, which until now has been elucidated, 
optionally in combination with an analysis of the functional differences between these a- 
amylases, may be used for assigning certain properties of the a-amylases to certain parts of 
the a-amylase structure or to contemplate such relationship. For instance, differences in the 
pattern or structure of loops surrounding the active site may result in differences in access 
to the active site of the substrate and thus differences in substrate specificity and/or cleavage 
pattern. Furthermore, parts of a Termamyl-like a-amylase involved in or contemplated to 
be involved in substrate binding (and thus e.g. specificity /cleavage pattern), calcium or 
sodium ion binding (e.g. of importance for the Calcium-dependency of the enzyme), and the 
like has been identified (vide infra). 

The modification of an amino acid residue or structural part is typically 
accomplished by suitable modifications of a DNA sequence encoding the parent enzyme in 
question. The term "modified" as used in step ii) in the method according to the first aspect 
of the invention is intended to have the following meaning: When used in relation to an 
amino acid residue the term is intended to mean replacement of the amino acid residue in 
question with another amino acid residue. When used in relation to a structural part, the 
term is intended to mean replacement of one or more amino acid residues of said structural 
part, addition of one or more amino acid residues to said part, or deletion of one or more 
amino acid residues of said structural part. 

The construction of the variant of interest is accomplished by cultivating a 
microorganism comprising a DNA sequence encoding the variant under conditions which are 
conducive for producing the variant, and optionally subsequently recovering the variant from 
the resulting culture broth. This is described in detail further below. 

First aspect of the invention 

In a preferred embodiment of the method according to the first aspect of the 
invention the property of the parent enzyme to be modified is selected from calcium 
dependency, substrate binding, cleavage pattern, pH dependent activity and the like. Specific 
examples of how to change these properties of a parent Termamyl-like a-amylase are given 
further below. 
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In another preferred embodiment the parent Termamyl-like a-amylase to be 
modified is a B. licheniformis a-amylase. 

Second and third aspects of the invention 

One important advantage of the methods according to the second and third 
aspects of the present invention is that it is possible to adapt the structure (or a structural 
part) of a Termamyl-like a-amylase to the structure (or structural part) of a non-Termamyl- 
like a-amylase and vide versa. For instance, having identified a loop structure of the non- 
Termamyl-like a-amylase which is believed to be responsible for or contributing to a 
particular property of the non-Termamyl-like a-amylase it is possible to replace the 
corresponding structure of the Termamyl-like a-amylase with said non-Termamyl-like a- 
amylase structure - or if no corresponding structure exists in the Termamyl-like a-amylase - 
to insert the structure into the Termamyl-like a-amylase in such a manner that the resulting 
variant Termamyl-like a-amylase, as far as the relevant part is concerned, resembles the 
corresponding part of the non-Termamyl-like a-amylase. When two or more parts of the 
structure of the parent Termamyl-like a-amylase are modified so as to resemble the 
corresponding parts of the non-Termamyl-like a-amylase it is possible to increase the 
resemblance to the non-Termamyl-like a-amylase of the Termamyl-like a-amylase variant 
and thus to alter the properties of said variant in the direction of those of said non-Termamyl- 
like a-amylase. Loop modifications are discussed in much further detail further below. 

Typically, the modification to be performed in step iii) of the method 
according to the second aspect of the invention is accomplished by deleting one or more 
amino acid residues of the part of the Termamyl-like a-amylase to be modified so as to adapt 
the structure of said part of the parent a-amylase to the corresponding part of the non- 
Termamyl-like a-amylase; by replacing one or more amino acid residues of the part of the 
Termamyl-like a-amylase to be modified with the amino acid residues occupying 
corresponding positions in the non-Termamyl-like a-amylase; or by insertion of one or more 
amino acid residues present in the non-Termamyl-like a-amylase into a corresponding 
position in the Termamyl-like a-amylase. For the method according to the third aspect the 
modification is to be understood analogously, performed on the non-Termamyl-like parent 
a-amylase rather than the Termamyl-like a-amylase. 
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In step ii) of the method according to the second or third aspect of the 
invention the part of the structure to be identified is preferably one which in the folded 
enzyme is believed to be in contact with the substrate (cf . the disclosure above in the section 
entitled "Substrate-binding site) or involved in substrate specificity and/or cleavage pattern, 
and/or one which is in contact with one of the calcium or sodium ions and/or one, which is 
contributing to the pH or temperature profile of the enzyme, or one which otherwise, from 
structural or functional considerations, is contemplated to be responsible for differences in 
one or more properties of the Termamyl-like and non-Termamyl-like a-amylase. 

Non-Termamyl-like a-amylase 

The non-Termamyl-like a-amylase with which the comparison is made in step 
i) of the method of the second aspect of the invention and which is the parent a-amylase in 
the method of the third aspect of the invention, may be any a-amylase, which does not 
belong to the family of TermamyHike a-amylases (as defined above) and, which as a 
consequence thereof, has a different three-dimensional structure. Furthermore, the non- 
Termamyl-like a-amylase should be one which has, at the time that the method is performed, 
an elucidated or contemplated three-dimensional structure. 

The non-Termamyl-like a-amylase may, e.g., be a fungal a-amylase, a 
mammalian or a plant a-amylase or a bacterial a-amylase (different from a Termamyl-like 
a-amylase). Specific examples of such a-amylases include the Aspergillus oryzae TAKA a- 
amylase, the A. niger acid a-amylase, the Bacillus subtilis a-amylase, the porcine pancreatic 
a-amylase and a barley a-amylase. All of these a-amylases have elucidated structures which 
are clearly different from the structure of the TermamyHike a-amylase shown herein. 

The fungal a-amylases mentioned above, i.e. derived from A. niger and A. 
oryzae, are highly homologous on the amino acid level and generally considered to belong 
to the same family of a-amylases. In the present disclosure, this family is termed 
"Fungamyl-like a-amylase" and intends to indicate an a-amylase which exhibits a high 
homology, i.e. more than 70%, such as 80% homologous (as defined herein) to the fungal 
a-amylase derived from Aspergillus oryzae, commercially available as Fungamyl®, and the 
A. niger a-amylase. 

From the enclosed illustrations of the a-amylase structure of a Termamyl-like 
a-amylase and a comparison of said structure with the structure of a Fungamyl-like a- 
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amylase it is evident that major differences exist between the two structures. In the method 
of the invention it is of particular interest to modify parts of the parent Termamyl-like a- 
amylase, which belong to a region with large differences to the Fungamyl-like a-amylase. 
In particular, it is of interest to modify the parent Termamyl-like a-amylase in one or more 
of the following loops: loop 1, loop 2, loop 3 and/or loop 8 of the parent a-amylase. 

In the method of the third aspect of the invention it is of particular interest to 
modify loop 1, loop 2, loop 3 and/ or loop 8 of the parent non-Termamyl-like a-amylase to 
a closer resemblance to the similar loops of a Termamyl-like a-amylase, such as TermamyL 

In the following specific types of variants are described which have been 
designed by use of the method of the invention. 

Loop modifications 

In order to change the substrate specificity of the parent a-amylase to be 
modified it is relevant to consider loop modifications. For instance changing one or more 
of the loop structures of the Termamyl-like a-amylase into a closer resemblance with the 
corresponding loop structure(s) of a non-Termamyl-like a-amylase (such as a Fungamyl-like 
a-amylase) it is contemplated that it is possible to change the substrate specificity in the 
direction of that of the non-Termamyl a-amylase. In the following different types of loop 
modifications of interest are listed. It will be understood that the variants may have other 
changed properties in addition to the modified substrate specificity. It will be understood that 
the following modifications identified for a specific Termamyl-like a-amylase are intended 
to include corresponding modifications in other equivalent positions of other Termamyl-like 
a-amylases. Furthermore, it will be understood that, normally, the loop modification will 
comprise replacement of an entire loop structure or a substantial part thereof in, e.g., the 
Termamyl-like a-amylase, with the corresponding loop structure (or substantial part thereof) 
in a non-Termamyl-like a-amylase. 

Loop! modifications 

In one embodiment the invention relates to a variant of a parent Termamyl-like 
a-amylase, in which variant, at least one amino acid residue of the parent a-amylase, which 
is/are present in a fragment corresponding to the amino acid fragment 44-57 of the amino 
acid sequence of SEQ ID No. 4, i.e. loop 2, has been deleted or replaced with one or more 
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amino acid residues which is/are present in a fragment corresponding to the amino acid 
fragment 66-84 of the amino acid sequence shown in SEQ ID No. 10, or in which one or 
more additional amino acid residues has been added using the relevant part of SEQ ID No. 
10 or a corresponding part of another Fungamyl-like a-amylase as a template. 

The amino acid sequence shown in SEQ ID No. 10 is the amino acid sequence 
of the v4. oryzae a-amylase, i.e. a Fungamyl-like a-amylase. It will be understood that 
amino acid residues or fragments found in corresponding positions in other a-amylases, in 
particular Fungamyl-like a- amylases, may be used as a template for the construction of the 
variant according to the invention. The corresponding part in other homologous a-amylases 
may easily be identified on the basis of a comparison of the amino acid sequences and/or 
three-dimensional structures of the respective a- amylases. 

For instance, the variant may be one, which, when the amino acid sequence 
of the variant is aligned most closely with the amino acid sequence of the said parent a- 
amylase, occupies the same position as the portion from residue X to residue Y of SEQ ID 
No 4, the said region having at least 80% such as at least 90% sequence homology with the 
part of SEQ ID No 10 extending from residue Z to residue V of SEQ ID No 10, wherein X 
is the amino acid residue occupying position 44, 45, 46, 47 or 48 of SEQ ID No. 4, Y is the 
amino acid residue occupying position 51, 52, 53, 54, 55, 56 or 57 of SEQ ID No. 4, Z is 
the amino acid residue occupying position 66, 67, 68, 69 or 70 of SEQ ID No. 10, and V 
is the amino acid residue occupying position 78, 79, 80, 81, 82, 83 or 84 of SEQ ID No. 
10. 

In other words, the variant may be one in which an amino acid fragment X-Y 
of the parent a-amylase, which corresponds to or is within the amino acid fragment 44-57 
of SEQ ID No. 4, has been replaced with an amino acid fragment Z-V, which corresponds 
to or is within the amino acid fragment 66-84 of the amino acid sequence shown in SEQ ID 
No. 10, in X, Y, Z and V have the meaning indicated above. 

A specific example of a variant according to this embodiment is a variant of 
a parent Termamyl-like a-amylase, in which the amino acid fragment of the parent a- 
amylase, which corresponds to amino acid residues 48-51 of SEQ ID No. 4, has been 
replaced with an amino acid fragment corresponding to amino acid residues 70-78 of the 
amino acid sequence shown in SEQ ID No. 10. 
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Loop 3 modifications - limited alteration 

In another embodiment the invention relates to a variant of a parent Termamyl- 
Iike a-amylase, in which variant, at least one of the amino acid residues of the parent a- 
amylase, which is/are present in an amino acid fragment corresponding to the amino acid 
5 fragment 195-202 of the amino acid sequence of SEQ ID No. 4, has been deleted or replaced 
with one or more of the amino acid residues which is/are present in an amino acid fragment 
corresponding to the amino acid fragment 165-177 of the amino acid sequence shown in SEQ 
ID No. 10, or in which one or more additional amino acid residues has been added using the 
relevant part of SEQ ID No. 10 or a corresponding part of another Fungamyl-like a-amylase 
10 as a template. 

For instance, the variant may be one in which an amino acid fragment X-Y 
^ of the parent a-amylase which corresponds to or is within the amino acid fragment 195-202 

of SEQ ID No. 4, has been replaced by an amino acid fragment Z-V, which corresponds to 
In or is within the amino acid fragment 165-177 of the amino acid sequence shown in SEQ ID 

qL5 No. 10, in which X is an amino acid residue corresponding to the amino acid occupying 
^ position 195 or 196 of SEQ ID No. 4, Y is an amino acid residue corresponding to the 

P amino acid occupying position 198, 199, 200, 201, or 202 of SEQ ID No. 4, Z is an amino 

P acid residue corresponding to the amino acid occupying position 165 or 166 of SEQ ID No. 

^ 10, and V is an amino acid residue corresponding to the amino acid occupying position 173, 

w20 174, 175, 176 or 177 of SEQ ID No. 10. 

Expressed in another manner, the variant according to this aspect may be one, 
which, when the amino acid sequence of variant is aligned most closely with the amino acid 
sequence of the said parent Termamyl-like a-amylase, occupies the same position as the 
portion from residue X to residue Y of SEQ ID No 4, the said region having at least 80%, 

2 5 such as 90% sequence homology with the part of SEQ ID No 10 extending from residue Z 

to residue V of SEQ ID No 10, the meaning of X, Y, Z and V being as identified above. 

A specific example of a variant according to this embodiment is a variant of 
a parent Termamyl-like a-amylase, in which the amino acid fragment of the parent ar- 
amylase, which corresponds to amino acid residues 196-198 of SEQ ID No. 4, has been 

3 0 replaced with the amino acid fragment corresponding to amino acid residues 166-173 of the 

amino acid sequence shown in SEQ ID No. 10. 
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Loop 3 modifications - complete domain B 

In a further embodiment the invention relates to a variant of a parent 
Termamyl-like a-amylase, in which variant, at least one of the amino acid residues of the 
parent a-amylase, which is/ are present in a fragment corresponding to the amino acid 
5 fragment 117-185 of the amino acid sequence of SEQ ID No. 4, has/have been deleted or 
replaced with one or more of the amino acid residues, which is/are present in an amino acid 
fragment corresponding to the amino acid fragment 98-210 of the amino acid sequence shown 
in SEQ ID No. 10, or in which one or more additional^ amino acid residues has been added 
using the relevant part of SEQ ID No. 10 or a corresponding part of another Fungamyl-like 
10 a-amylase as a template. 

For instance, the variant may be one, in which an amino acid fragment X-Y 
% of the parent a-amylase, which corresponds to or is within the amino acid fragment 117-185 

W of SEQ ID No. 4, has been replaced with an amino acid fragment Z-V, which corresponds 

Hi to or is within the amino acid fragment 98-210 of the amino acid sequence shown in SEQ 

qL5 ID No. 10, in which variant, X is an amino acid residue corresponding to the amino acid 
m occupying position 117, 118, 119, 120 or 121 of SEQ ID No. 4, Y is an amino acid residue 

O corresponding to the amino acid occupying position 181, 182, 183, 184 or 185 of SEQ ID 

p No. 4, Z is an amino acid residue corresponding to the amino acid occupying position 98, 

99, 100, 101, 102 of SEQ ID No. 10, and V is an amino acid residue corresponding to the 
vEfeo amino acid occupying position 206, 207, 208, 209 or 210 of SEQ ID No. 10. 

A specific example of a variant according to this embodiment is a variant of 
a parent a-amylase, in which an amino acid fragment of the parent a-amylase, which 
corresponds to amino acid residues 121-181 of SEQ ID No. 4, has been replaced with the 
amino acid fragment corresponding to amino acid residues 102-206 of the amino acid 
25 sequence shown in SEQ ID No. 10. 

In another embodiment the invention relates to a variant of a parent Termamyl- 
like a-amylase, in which variant, at least one of the amino acid residues of the parent a- 
amylase, which is/are present in a fragment corresponding to the amino acid fragment 117- 
181 of the amino acid sequence of SEQ ID No. 4, has/have been deleted or replaced with 
3 0 one or more of the amino acid residues, which is/are present in an amino acid fragment 
corresponding to the amino acid fragment to 98-206 of the amino acid sequence shown in 
SEQ ID No. 10, or in which one or more additional amino acid residues has been added 
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using the relevant part of SEQ ID No. 10 or a corresponding part of another Fungamyl-like 
a-amylase as a template. 

For instance, the variant may be one, in which the amino acid fragment X-Y 
of the parent a-amylase, which corresponds to or is within the amino acid fragment 117-177 
if SEQ ID No. 4, has/have been replaced with an amino acid fragment Z-V, which 
corresponds to or is within the amino acid fragment 98-202 of the amino acid sequence 
shown in SEQ ID No. 10, in which variant, X is an amino acid residue corresponding to the 
amino acid occupying position 117, 118, 119, 120 or 121 of SEQ ID No. 4, Y is an amino 
acid residue corresponding to the amino acid occupying position 174, 175, 176 or 177 of 
SEQ ID No. 4, Z is an amino acid residue corresponding to the amino acid occupying 
position 98, 99, 100, 101, 102 of SEQ ID No. 10, and V is an amino acid residue 
corresponding to the amino acid occupying position 199, 200, 201 or 202 of SEQ ID No. 
10. 

A specific example of a variant according to this embodiment of the invention 
is a variant, in which the amino acid fragment of the parent a-amylase, which corresponds 
to amino acid residues 121-174 of SEQ ID No. 4, has been replaced with the amino acid 
fragment corresponding to amino acid residues 102-199 of the amino acid sequence shown 
in SEQ ID No. 10. 

Loop 1 modifications - minimal addition 

In a further embodiment the present invention relates to a variant of a parent 
Termamyl-like a-amylase, in which variant at least one of the amino acid residues of the 
parent a-amylase, which is/are present in an amino acid fragment corresponding to the amino 
acid fragment 12-19 of the amino acid sequence of SEQ ID No. 4, has/have been deleted or 
replaced with one or more of the amino acid residues, which is/are present in an amino acid 
fragment which corresponds to the amino acid fragment 28-42 of SEQ ID No. 10, or in 
which one or more additional amino acid residues has/have been inserted using the relevant 
part of SEQ ID No. 10 or a corresponding part of another Fungamyl-like a-amylase as a 
template. 

For instance, the variant may be one, in which the amino acid fragment X-Y 
of the parent a-amylase, which corresponds to or is within the amino acid fragment 12-19 
of SEQ ID No. 4, has/have been replaced with an amino acid fragment Z-V, which 
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corresponds to or is within the amino acid fragment 28-42 of the amino acid sequence shown 
in SEQ ID No. 10, in which variant, X is an amino acid residue corresponding to the amino 
acid occupying position 12, 13 or 14 of SEQ ID No. 4, Y is an amino acid residue 
corresponding to the amino acid occupying position 15, 16, 17, 18 or 19 of SEQ ID No. 4, 
5 Z is an amino acid residue corresponding to the amino acid occupying position 28, 29, 30 
, 31 or 32 of SEQ ID No. 10, and V is an amino acid residue corresponding to the amino 
acid occupying position 38, 39, 40, 41 or 42 of SEQ ID No. 10. 

A specific example of a variant according to this aspect of the invention is a 
variant, in which the amino acid fragment of the parent a-amylase, which corresponds to 

10 amino acid residues 14-15 of SEQ ID No. 4, has been replaced with the amino acid fragment 
corresponding to amino acid residues 32-38 of the amino acid sequence shown in SEQ ID 

1 No. 10. 

;ji Loop 1 modifications - complete loop 

ij-5 In a further embodiment the invention relates to a variant of a parent 

^ Termamyl-Iike a-amylase, in which variant at least one of the amino acid residues of the 

"3 parent a-amylase, which is present in a fragment corresponding to amino acid residues 7-23 

^ of the amino acid sequence of SEQ ID No. 4, has/have been deleted or replaced with one 

^ or more amino acid residues, which is/are present in an amino acid fragment corresponding 

*(2 0 to amino acid residues 13-45 of the amino acid sequence shown in SEQ ID No. 10, or in 
which one or more additional amino acid residues has/have been inserted using the relevant 
part of SEQ ID No. 10 or a corresponding part of another Fungamyl-like a-amylase as a 
template. 

For instance, the variant may be one, in which the amino acid fragment X-Y 
25 of the parent a-amylase, which corresponds to or is within the amino acid fragment 7-23 of 
SEQ ID No. 4, has/have been replaced with an amino acid fragment Z-V, which corresponds 
to or is within the amino acid fragment 13-45 of the amino acid sequence shown in SEQ ID 
No. 10, in which variant, X is an amino acid residue corresponding to the amino acid 
occupying position 7 or 8 of SEQ ID No. 4, Y is an amino acid residue corresponding to the 
3 0 amino acid occupying position 18, 19, 20, 21, 22 or 23 of SEQ ID No. 4, Z is an amino 
acid residue corresponding to the amino acid occupying position 13 or 14 of SEQ ID No. 10, 
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and V is an amino acid residue corresponding to the amino acid occupying position 40, 41, 
42, 43, 44 or 45 of SEQ ID No. 10. 

A specific variant according to this embodiment is one, in which the amino 
acid fragment of the parent a-amylase, which corresponds to amino acid residues 8-18 of 
SEQ ID No. 4, has been replaced with the amino acid fragment corresponding to amino acid 
residues 14-40 of the amino acid sequence shown in SEQ ID No. 10. 

Loop 8 modifications 

In a further embodiment the invention relates to a variant of a parent 
Termamyl-like a-amylase, in which variant at least one of the amino acid residues of the 
parent a-amylase, which is present in a fragment corresponding to amino acid residues 322- 
346 of the amino acid sequence of SEQ ID No. 2, has/have been deleted or replaced with 
one or more amino acid residues, which is/are present in an amino acid fragment 
corresponding to amino acid residues 291-313 of the amino acid sequence shown in SEQ ID 
No. 10, or in which one or more additional amino acid residues has/have been inserted using 
the relevant part of SEQ ID No. 10 or a corresponding part of another Fungamyl-like a- 
amylase as a template. 

For instance, the variant may be one, in which the amino acid fragment X-Y 
of the parent a-amylase, which corresponds to or is within the amino acid fragment 322-346 
of SEQ ID No. 2, has/have been replaced with an amino acid fragment Z-V, which 
corresponds to or is within the amino acid fragment 291-313 of the amino acid sequence 
shown in SEQ ID No. 10, in which variant, X is an amino acid residue corresponding to the 
amino acid occupying position 322, 323, 324 or 325 of SEQ ID No. 2, Y is an amino acid 
residue corresponding to the amino acid occupying position 343, 344, 345 or 346 of SEQ ID 
No. 2, Z is an amino acid residue corresponding to the amino acid occupying position 291, 
292, 293 or 294 of SEQ ID No. 10, and V is an amino acid residue corresponding to the 
amino acid occupying position 310, 311, 312 or 313 of SEQ ID No. 10. 

A specific variant according to this aspect of the invention is one, in which the 
amino acid fragment of the parent a-amylase, which corresponds to amino acid residues 325- 
345 of SEQ D No. 2, has been replaced with the amino acid fragment corresponding to 
amino acid residues 294-313 of the amino acid sequence shown in SEQ ID No. 10. 
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Ca 2+ dependency 

It is highly desirable to be able to decrease the Ca 2+ dependency of a 
Termamyl-like a-amylase. Accordingly, in a further aspect the invention relates to a variant 
of a parent Termamyl-like a-amylase, which exhibits a-amylase activity and which has a 
decreased Ca 2+ dependency as compared to the parent a-amylase. The decreased Ca 2+ 
dependency has the functional result that the variant exhibits a satisfactory amylolytic activity 
in the presence of a lower concentration of calcium ion in the extraneous medium than is 
necessary for the parent enzyme and, for example, therefore is less sensitive than the parent 
to calcium ion-depleting conditions such as those obtained in media containing calcium- 
complexing agents (such as certain detergent builders). 

The decreased Ca 2+ dependency of the variant of the invention may 
advantageously be achieved by increasing the Ca 2+ binding affinity of the parent Termamyl- 
like a-amylase, in other words the stronger the Ca 2+ binding of the enzyme, the lower is the 
Ca 2+ dependency. 

It is presently believed that amino acid residues located within 10 A from a 
sodium or calcium ion are involved in or are of importance for the Ca 2+ binding capability 
of the enzyme. 

Accordingly, the variant according to this aspect of the invention is preferably 
one, which has been modified in one or more amino acid residues present within 10 A from 
a calcium and/or sodium ion identified in the three-dimensional Termamyl-like a-amyiase 
structure in such a manner that the affinity of the a-amylase for calcium is increased. 

The amino acid residues found within a distance of 10 A from the Ca 2+ 
binding sites of the B, licheniformis a-amylase with the amino acid sequence SEQ ID NO 
2 were determined as described in Example 2 and are as follows: 

V102, 1103, N104, H105, K106, R125, W155, W157, Y158, H159, F160, D161, G162, 
T163, Y175, K176, F177, G178, K180, A181, W182, D183, W184, E185, V186, S187, 
N192, Y193, D194, Y195, L196, M197, Y198, A199, D200, 1201, D202, Y203, D204, 
H205, P206, V208, A209, D231, A232, V233, K234, H235, 1236, K237, F238, F240, 
L241, A294, A295, S296, T297, Q298, G299, G300, G301, Y302, D303, M304, R305, 
K306, L307, W342, F343, L346, Q393, Y394, Y396, H405, H406, D407, 1408, V409, 
R413, E414, G415, D416, S417, V419, A420, N421, S422, G423, L424, 1428, T429, 
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D430, G431, P432, V440, G441, R442, Q443, N444, A445, G446, E447, T448, W449, 
1462, G475, Y480, V481, Q482, R483. 

In order to construct a variant according to this aspect of the invention it is 
desirable to replace at least one of the above mentioned amino acid residues (or an amino 
acid residue occupying an equivalent position in another Termamyl-like a-amylase than that 
defined by SEQ ID NO 2), which is contemplated to be involved in providing a non-optimal 
calcium binding, with any other amino acid residue which improves the Ca 2+ binding affinity 
of the variant enzyme. In practice, the identification and subsequent modification of the 
amino acid residue is performed by the following method: 

i) identifying an amino acid residue within 10 A from a Ca 2+ binding site of 
a Termamyl-like a-amylase structure, which from structural or functional considerations is 
believed to be responsible for a non-optimal calcium ion interaction, 

ii) constructing a variant in which said amino acid residue is replaced with 
another amino acid residue which from structural or functional considerations is believed to 
be important for establishing a higher Ca 2+ binding affinity, and testing the Ca 2+ dependency 
of the resulting Termamyl-like a-amylase variant. 

In the present context, the term "non-optimal calcium ion interaction" is 
intended to indicate that the amino acid residue in question is selected on the basis of a 
presumption that substituting said amino acid residue for another may improve a calcium ion 
binding interaction of the enzyme. For instance, the amino acid residue in question may be 
selected on the basis of one or more of the following considerations: 

- to obtain an improved interaction between a calcium ion and an amino acid 
residue located near to the surface of the enzyme (as identified from the structure of the 
Termamyl-like ce-amylase). For instance, if the amino acid residue in question is exposed 
to a surrounding solvent, it may be advantageous to increase the shielding of said amino acid 
residue from the solvent so as to provide for a stronger interaction between said amino acid 
residue and a calcium ion. This can be achieved by replacing said residue (or an amino acid 
residue in the vicinity of said residue contributing to the shielding) by an amino acid residue 
which is more bulky or otherwise results in an improved shielding effect. 

- to stabilize a calcium binding site, for instance by stabilizing the structure 
of the Termamyl-like a-amylase (e.g. by stabilizing the contacts between the A, B and C 
domains or stabilizing one or more of the domains as such). This may, e.g., be achieved 
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by providing for a better coordination to amino acid side chains, which may, e.g., be 
obtained by replacing an N residue with a D residue and/or a Q residue with an E residue 
(e.g. N104D), e.g. within 10 A, and preferably within 3 or 4A, of a calcium binding site. 

- to protect the calcium binding site or to improve the coordination between 
5 the calcium ion and the calcium binding site, e.g. by providing a stronger interaction between 
the ion and the binding site. 

Before actually constructing a Termamyl-like a-amylase variant according to 
the above principles it may be convenient to evaluate the contemplated amino acid 
modification by its accommodation into the Termamyl-like a-amylase structure, e.g. into a 
10 model structure of the parent Termamyl-like a-amylase. 

Preferably, the amino acid residue to be modified is located within 8 A of a 
™! Ca 2+ binding site residue, such as within 5 A of such residue. The amino acid residues 

iv within 8 A and 5 A, respectively, may easily be identified by an analogous method used for 

m identifying amino acid residues within 10 A (cf. Example 2). 

Si 5 The following mutation is contemplated to be of particular interest with respect 

- to decreasing the Ca 2+ dependency of a Termamyl-like a-amylase: 

O N104D (of the B. licheniformis a- amylase SEQ ID NO 2, or an equivalent (N to D) mutation 

p of an equivalent position in another Termamyl-like ceamylase.) 

^ In connection with substitutions of relevance for Ca 2+ dependency, some other 

0 substitutions appear to be of importance in stabilizing the enzyme conformation (for instance 
the Domains A-B and/or Domains A-C interactions contributing to the overall stability of the 
enzyme) in that they may, e.g., enhance the strength of binding or retention of calcium ion 
or sodium ion at or within a calcium or sodium binding site, respectively, within the parent 
Termamyl-like cx-amylase. 

2 5 It is desirable to stabilize the C-domain in order to increase the calcium 

stability and/or thermostability of the enzyme. In this connection the stabilization may result 
in a stabilization of the binding of calcium by the enzyme, and an improved contact between 
the C-domain and the A-domain (of importance for thermostability). The latter may be 
achieved by introduction of cysteine bridges, salt bridges or increase hydrogen, hydrophobic 

3 0 and/or electrostatic interactions. 
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For instance, the C-domain of the B. licheniformis a-amylase having the amino 
acid sequence shown in SEQ ID No. 2 may be stabilized by introduction of a cysteine bridge 
between domain A and domain C, e.g. by introducing of the following mutations: 
A349C+I479C and/or L346C+I430C. 

A salt bridge may be obtained by introduction of the following mutations: 

N457D,E 
N457D,E+K385R 
F350D,E+I430R,K 
F350D,E-fI411R,K 

The calcium site of Domain C may be stabilized by replacing the amino acid 
residues H408 and/or G303 with any other amino acid residue. Of particular interest is the 
following mutations: 
H408Q,E,N,D and/or G303N,D,Q,E 

which are contemplated to provide a better calcium binding or protection from calcium 
depletion. 

Similar mutations may be introduced in equivalent positions of other 
Termamyl-like a-amylases. 

Other substitution mutations (relative to B. licheniformis a-amylase, SEQ ID 
No. 2) which appear to be of importance, inter alia, in the context of reducing calcium 
dependency include the following: R23K, H156Y, A181T, A209V and G310D (or equivalent 
mutations in equivalent positions in another Termamyl-like a-amylase). Substitutions of 
R214 and P345 with other amino acids may also be of importance in this connection. 

Variants with altered activity at higher/lower pH 

It is contemplated that it is possible to change the pH optima of a Termamyl- 
like a-amylase or the enzymatic activity at a given pH by changing the pKa of the active site 
residues. This may be achieved, e.g. by changing the electrostatic interaction or hydrophobic 
interaction between functional groups of amino acid side chains of the amino acid residue to 
be modified and of its close surroundings. This may, e.g., be accomplished by the following 
method: 

i) in a structure of the Termamyl-like a-amylase in question to identifying an 
amino acid residue within 15 A from an active site residue, in particular 10 A from an active 
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site residue, which amino acid residue is contemplated to be involved in electrostatic or 
hydrophobic interactions with an active site residue, 

ii) replacing, in the structure, said amino acid residue with an amino acid 
residue which changes the electrostatic and/or hydrophobic surroundings of an active site 
residue and evaluating the accommodation of the amino acid residue in the structure, 

iii) optionally repeating step i) and/or ii) until an amino acid replacement has 
been identified which is accommodated into the structure, 

iv) constructing a Termamyl-like a-amylase variant resulting from steps i), ii) 
and optionally iii) and testing the pH dependent enzymatic activity of interest of said variant. 

In the above method it may be of particular relevance to add a positively 
charged residue within 5 A of a glutamate (thereby lowering the pKa of the glutamate from 
about 4.5 to 4), or to add a negatively charged residue within 5 A of a glutamate (thereby 
increasing the pKa to about 5), or to make similar modifications within a distance of about 
5 A of a Histidine. 

In a further aspect the invention relates to a variant of a Termamyl-like a- 
amylase which exhibits a higher activity at a lower pH (e.g. compared to the pH optimum) 
than the parent a-amylase. In particular, the variant comprises a mutation of an amino acid 
residue corresponding to at least one of the following positions of the B. licheniformis a- 
amylase (SEQ ID NO 2): 

E336, Q333, P331, 1236, V102, A232, 1103, L196 

The following mutations are of particular interest: 

E336R,K 
Q333R,K 
P331R,K 

V102R,K,A,T,S,G; 

I236K,R,N; 

I103K,R; 

L196K,R; 

A232T,S,G; 

or any combination of two or more of these variants or any combination of one or more of 
these variants with any of the other variants disclosed herein. 
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In a still further aspect the invention relates to a variant of a Termamyl-like 
a- amylase which has a higher activity at a higher pH than the parent a-amylase. In 
particular, the variant comprises a mutation of an amino acid residue corresponding to at 
least one of the following positions of the B. licheniformis a-amylase (SEQ ID NO 2): 
N236, H281, Y273 

In particular, the variant comprises a mutation corresponding to at least one 
of the following mutations of the B. licheniformis a-amylase (SEQ ID NO 2): 
N326I,Y,F,L,V 
H281F,I,L 
Y273F,W 

or any combination of two or more of these variants or any combination of one or more of 
these variants with any of the other variants disclosed herein. 

A mutation which appears to be importance in relation to the specific activity 
of variants of the invention is a mutation corresponding to the substitution S187D in B. 
licheniformis a-amylase (SEQ ID NO 2). 

Variants with increased thermostability and/ or altered temperature optimum 

In a further desired aspect the invention relates to a variant of a parent 
Termamyl-like a- amylase, which variant is the result of one or more amino acid residues 
having been deleted from, replaced or added to the parent a-amylase so as to obtain an 
increased thermostability of the variant. 

The Termamyl-like a-amylase structure contains a number of unique internal 
holes, which may contain water, and a number of crevices. In order to increase the 
thermostability of the a-amylase it may be desirable to reduce the number of holes and 
crevices (or reduce the size of the holes or crevices), e.g. by introducing one or more 
hydrophobic contacts, preferably achieved by introducing bulkier residues, in the vicinity or 
surroundings of the hole. For instance, the amino acid residues to be modified are those 
which are involved in the formation of the hole. 

Accordingly, in a further aspect the present invention relates to a method of 
increasing the thermostability and/or altering the temperature optimum of a parent Termamyl- 
like a-amylase, which method comprises 



i) identifying an internal hole or a crevice of the parent Termamyl-like a- 
amylase in the three-dimensional structure of said a-amylase, 

ii) replacing, in the structure, one or more amino acid residues in the 
neighborhood of the hole or crevice identified in i) with another amino acid residue which 

5 from structural or functional considerations is believed to increase the hydrophobic 
interaction and to fill out or reduce the size of the hole or crevice, 

iii) constructing a Termamyl-like a-amylase variant resulting from step ii) and 
testing the thermostability and/or temperature optimum of the variant. 

The structure used for identifying the hole or crevice of the parent Termamyl- 
10 like a-amylase may be the structure identified in Appendix 1 or a model structure of the 

parent Termamyl-like a-amylase built thereon, 
r. It will be understood that the hole or crevice is identified by, the amino acid 

7^ residues surrounding the hole/crevice, and that modification of said amino acid residues are 

% of importance for filling or reducing the size of the hole/crevice. The particular amino acid 

35 residues referred to below are those which in crystal structure have been found to flank the 
"1 hole/crevice in question. 

!L In order to fill (completely or partly) a major hole located between domain A 

Em and B, mutation to any other amino acid residue of an amino acid residue corresponding to 

iJj one or more of the following residues of the B, licheniformis a-amylase (SEQ ID NO 2) is 

% 0 contemplated: 

L61, Y62, F67, K106, G145, 1212, S151, R214, Y150, F143, R146 

Of particular interest is a mutation to a more bulky amino acid residue than 
the amino acid residue of the parent enzyme. 

Of particular interest is a variant of a Termamyl-like a-amylase which 

2 5 comprises a mutation corresponding to the following mutations (using the numbering of B. 

licheniformis a-amylase (SEQ ID NO 2): 

L61W,V,F; 

Y62W; 

F67W; 

3 0 K106R,F,W; 

G145F/W 
I212F,L,W,Y,R,K; 
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S151 replaced with any other amino acid residue and in particular with F,W,I or L; 

R214W; 

Y150R,K; 

F143W; and/or 

R146W. 

In order to fill a hole in the vicinity of the active site mutation to any other 
amino acid residue of an amino acid residue corresponding to one or more of the following 
residues of the B. licheniformis a-amylase (SEQ ID NO 2) is contemplated: 
L241, 1236. 

Of interest is a mutation to a more bulky amino acid residue. 

Of particular interest is a variant of a Termamyl-like a-amylase which 
comprises a mutation corresponding to one or more of the following mutations in the B. 
licheniformis a-amylase: 
L241I,F,Y,W; and/or 
I236L,F,W,Y. 

In order to fill a hole in the vicinity of the active site mutation to any other 
amino acid residue of an amino acid residue corresponding to one or more of the following 
residues of the B. licheniformis a-amylase (SEQ ID NO 2) is contemplated: 
L7, V259, F284 

Of interest is a mutation to a more bulky amino acid residue. 

Of particular interest is a variant of a Termamyl-like a-amylase which 
comprises a mutation corresponding to one or more of the following mutations in the B. 
licheniformis a-amylase: 
L7F,I ? W 
V259F,I,L 
F284W 

In order to fill a hole in the vicinity of the active site mutation to any other 
amino acid residue of an amino acid residue corresponding to one or more of the following 
residues of the B. licheniformis a-amylase (SEQ ID NO 2) is contemplated: 
F350, F343 

Of interest is a mutation to a more bulky amino acid residue. 
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Of particular interest is a variant of a Termamyl-like a-amylase which 
comprises a mutation corresponding to one or more of the following mutations in the B. 
licheniformis a-amylase: 
F350W 
F343W 

In order to fill a hole in the vicinity of the active site mutation to any other 
amino acid residue of an amino acid residue corresponding to one or more of the following 
residues of the B. licheniformis a-amylase (SEQ ID NX) 2) is contemplated: 
1427, V481 

Of interest is a mutation to a more bulky amino acid residue. 

Of particular interest is a variant of a Termamyl-like a-amylase which 
comprises a mutation corresponding to one or more of the following mutations in the B. 
licheniformis a-amylase: 
L427F,L,W 
V481,F,I,L,W 

Variants with an altered cleavage pattern 

In the starch liquefaction process it is desirable to use an a-amylase which is 
capable of degrading the starch molecules into long branched oligo saccharides (like, e.g. the 
Fungamyl-like a-amylases) rather than shorter branched oligo saccharides (like conventional 
Termamyl-like a-amylases). The resulting very small branched oligosaccharides (panose 
precursors) cannot be hydrolyzed properly by pullulanases, which in the liquefaction process 
are used after the a-amylases and before the amyloglucosidases. Thus, in the presence of 
panose precursors the action of glucoamylase ends up with a high degree of the small 
branched limiting-dextrin, the trisaccharide panose. The presence of panose lowers the 
saccharification yield significantly and is thus undesirable. 

Thus, one aim of the present invention is to change the degradation 
characteristics of a Termamyl-like a-amylase to that of a Fungamyl-like a-amylases without 
at the same time reducing the thermostability of the Termamyl-like a-amylase. 

Accordingly, in a further aspect the invention relates to a variant of a 
Termamyl-like a-amylase which has a reduced ability to cleave a substrate close to the 
branching point. 
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The variant may suitably be constructed by a method which comprises 

i) identifying the substrate binding area of the parent Termamyl-like a-amylase 
in a model of the three-dimensional structure of said a-amylase, (e.g. within a sphere of 4 A 
from the substrate binding site (as defined in the section above entitled "Substrate Binding 
Site"), 

ii) replacing, in the model, one or more amino acid residues of the substrate 
binding area of the cleft identified in i), which is/are believed to be responsible for the 
cleavage pattern of the parent a-amylase, with another amino acid residue which from 
structural considerations is believed to result in an altered substrate cleavage pattern, or 
deleting one or more amino acid residues of the substrate binding area contemplated to 
introduce favorable interactions to the substrate or adding one or more amino acid residues 
to the substrate binding area contemplated to introduce favorable interactions to the substrate, 
and 

iii) constructing a Termamyl-like a-amylase variant resulting from step ii) and 
testing the substrate cleavage pattern of the variant. 

Of particular interest is a variant which cleaves an amylopectin substrate, from 
the reducing end, more than one glucose unit from the branching point, preferably more than 
two or three glucose units from the branching point, i.e. at a further distance from the 
branching point than that obtained by use of a wild type 5. licheniformis a-amylase. 

Residues of particular interest in connection with this aspect of the invention 
correspond to the following residues of the B. licheniformis a-amylase (SEQ ID NO 2): V54, 
D53, Y56, Q333, G57, and the variants according to this aspect preferably comprises a 
mutation in one or more of these residues. 

In particular, the variant comprises at least one of the following mutations, 
which are expected to prevent cleavage close to the branching point: 
V54L,I,F,Y,W,R,K,H,E,Q 
D53L,I,F,Y,W 
Y56W 
Q333W 

G57all possible amino acid residues 

A52amino acid residues larger than A, e.g. A52W,Y,L,F,I. 
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Variants of a fungal q-amvlase 

In a still further embodiment the invention relates to a variant of a parent 
Fungamyl-like cx-amylase, in which variant at least one of the amino acid residues of the 
parent a-amylase, which is/are present in an amino acid fragment corresponding to amino 
5 acid residues 291-313 of the amino acid sequence of SEQ ID No. 10, has/have been deleted 
or replaced with one or more of the amino acid residues, which is/are present in an amino 
acid fragment corresponding to amino acid residues 98-210 of the amino acid sequence 
shown in SEQ ID No. 4, or in which one or more additional amino acid residues has/have 
been inserted using the relevant part of SEQ ID No. 4 or a corresponding part of another 
10 Termamyl-like a- amylase as a template. 

For instance, the variant may be one, in which the amino acid fragment X-Y 
l% s of the parent a-amylase, which corresponds to or is within the amino acid fragment 117-185 
w of SEQ ID No. 10, has/have been replaced with an amino acid fragment Z-V, which 
if! corresponds to or is within the amino acid fragment 98-210 of the amino acid sequence 

ris shown in SEQ ID No. 4, in which variant, X is an amino acid residue corresponding to the 
^- amino acid occupying position 117, 118, 119, 120 or 121 of SEQ ID No. 10, Y is an amino 
D acid residue corresponding to the amino acid occupying position 181, 182, 183, 184 or 185 
p of SEQ ID No. 10, Z is an amino acid residue corresponding to the amino acid occupying 
% position 98, 99, 100, 101 or 102 of SEQ ID No. 4, and V is an amino acid residue 
#) corresponding to the amino acid occupying position 206, 207, 208, 209 or 210 of SEQ ID 
No. 4. 

A specific example of a variant according to this aspect of the invention is one, 
in which the amino acid fragment of the parent a-amylase, which corresponds to amino acid 
residues 121-181 of SEQ ID No. 10, has been replaced with the amino acid fragment 

2 5 corresponding to amino acid residues 102-206 of the amino acid sequence shown in SEQ ID 

No. 4. 

Another example of a variant according to this aspect of the invention is one, 
in which the amino acid fragment of the parent a-amylase, which corresponds to amino acid 
residues 121-174 of SEQ ID No. 10, has been replaced with the amino acid fragment 

3 0 corresponding to amino acid residues 102-199 of the amino acid sequence shown in SEQ ID 

No. 4. 
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In a further embodiment the invention relates to a variant of a parent 
Fungamyl-like a-amylase, in which an amino acid fragment corresponding to amino acid 
residues 181-184 of the amino acid sequence shown in SEQ ID No. 10 has been deleted. 

5 General mutations in variants of the invention 

It may be preferred that the variant of the invention or prepared in accordance 
with the method of the invention comprises one or more modifications in addition to those 
outlined above. Thus, it may be advantageous that one or more proline residues present in 
the part of the a-amylase variant having been modified is/are replaced with a non-proline 
10 residue which may be any of the possible, naturally occurring non-proline residues, and 

which preferably is an alanine, glycine, serine, threonine, valine or leucine. 
'% Analogously, it may be preferred that one or more cysteine residues present 

lf s in the amino acid residues with which the parent a-amylase is modified are replaced with a 
in . non-cysteine residues such as serine, alanine, threonine, glycine, valine or leucine. 
|f5 Furthermore, the variant of the invention may either as the only modification 

hhS or in combination with any of the above outlined modifications be modified so that one or 

ret 

C more Asp and/or Glu present in an amino acid fragment corresponding to the amino acid 
Q fragment 1-85-209 of SEQ ID No. 8 is replaced by an Asn and/or Gin, respectively. Also 
If, of interest is the modification of one or more of the Lys residues present in the Termamyl- 
zf-b like a-amylase is replaced by an Arg present in an amino acid fragment corresponding to the 

amino acid fragment 185-209 of SEQ ID No. 8 is replaced by an Asn and/or Gin, 

respectively. 

It will be understood that in accordance with the present invention variants 
may be prepared which carry two or more of the above outlined modifications. For instance, 
25 variants may be prepared which comprises a modification in the loop 1 and loop 2 region, 
a modification in loop 2 and limited loop 3, a modification in loop 1, loop 2, loop 3 and loop 
8, etc. 

Furthermore, it may be advantageous to introduce point-mutations in any of 
the variants described herein. 

30 
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Methods of preparing a-amylase variants 

Several methods for introducing mutations into genes are known in the art. 
After a brief discussion of the cloning of a-amylase-encoding DNA sequences, methods for 
generating mutations at specific sites within the a-amylase-encoding sequence will be 
5 discussed. 



Cloning a DNA sequence encoding an a-amylase 

The DNA sequence encoding a parent a-amylase may be isolated from any cell 

or microorganism producing the a-amylase in question, using various methods well known 
10 in the art. First, a genomic DNA and/or cDNA library should be constructed using 

chromosomal DNA or messenger RNA from the organism that produces the a-amylase to 
y be studied. Then, if the amino acid sequence of the a-amylase is known, homologous, 
III labelled oligonucleotide probes may be synthesized and used to identify a-amylase-encoding 
III clones from a genomic library prepared from the organism in question. Alternatively, a 
jij5 labelled oligonucleotide probe containing sequences homologous to a known a-amylase gene 
m could be used as a probe to identify a-amylase-encoding clones, using hybridization and 
O washing conditions of lower stringency. 

Yet another method for identifying a-amylase-encoding clones would involve 

inserting fragments of genomic DNA into an expression vector, such as a plasmid, 
#0 transforming a-amylase-negative bacteria with the resulting genomic DNA library, and then 

plating the transformed bacteria onto agar containing a substrate for a-amylase, thereby 

allowing clones expressing the a-amylase to be identified. 

Alternatively, the DNA sequence encoding the enzyme may be prepared 

synthetically by established standard methods, e.g. the phosphoroamidite method described 
25 by S.L. Beaucage and M.H. Caruthers (1981) or the method described by Matthes et al. 

(1984). In the phosphoroamidite method, oligonucleotides are synthesized, e.g. in an 

automatic DNA synthesizer, purified, annealed, ligated and cloned in appropriate vectors. 

Finally, the DNA sequence may be of mixed genomic and synthetic origin, 

mixed synthetic and cDNA origin or mixed genomic and cDNA origin, prepared by ligating 
3 0 fragments of synthetic, genomic or cDNA origin (as appropriate, the fragments 

corresponding to various parts of the entire DNA sequence), in accordance with standard 
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techniques. The DNA sequence may also be prepared by polymerase chain reaction (PCR) 
using specific primers, for instance as described in US 4,683,202 or R.K. Saiki et al. (1988). 

Site-directed mutagenesis 
5 Once an a-amylase-encoding DNA sequence has been isolated, and desirable 

sites for mutation identified, mutations may be introduced using synthetic oligonucleotides. 

These oligonucleotides contain nucleotide sequences flanking the desired mutation sites; 

mutant nucleotides are inserted during oligonucleotide synthesis. In a specific method, a 

single-stranded gap of DNA, bridging the a-amylase-encoding sequence, is created in a 
10 vector carrying the a-amylase gene. Then the synthetic nucleotide, bearing the desired 

mutation, is annealed to a homologous portion of the single-stranded DNA. The remaining 

gap is then filled in with DNA polymerase I (Klenow fragment) and the construct is ligated 
ij using T4 ligase. A specific example of this method is described in Morinaga et al. (1984). 
•JJ US 4,760,025 discloses the introduction of oligonucleotides encoding multiple mutations by 
15 performing minor alterations of the cassette. However, an even greater variety of mutations 
Ly can be introduced at any one time by the Morinaga method, because a multitude of 
™, oligonucleotides, of various lengths, can be introduced. 

j\/ Another method of introducing mutations into a-amylase-encoding DNA 

U sequences is described in Nelson and Long (1989). It involves the 3-step generation of a 
jjhp PCR fragment containing the desired mutation introduced by using a chemically synthesized 
DNA strand as one of the primers in the PCR reactions. From the PCR-generated fragment, 
a DNA fragment carrying the mutation may be isolated by cleavage with restriction 
endonucleases and reinserted into an expression plasmid. 

2 5 Random mutagenesis 

Random mutagenesis is suitably performed either as localized or region- 
specific random mutagenesis in at least three parts of the gene translating to the amino acid 
sequence shown in question, or within the whole gene. 

For region-specific random mutagenesis with a view to improving the thermal 

3 0 stability of a parent Termamyl-like a-amylase, codon positions corresponding to the 

following amino acid residues of the B. licheniformis a-amylase (SEQ ID NO 2) may 
appropriately be targeted: 
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To improve the stability of the calcium site between Domain A and C 

I428-A435 

T297-L308 

F403-V409 

5 

To improve the stability between domain A and B: 

D180-D204 

H156-T163 

A232-F238 

10 

With a view to achieving improved binding of a substrate (i.e. improved 
w binding of a carbohydrate species, such as amy lose or amylopectin) by a Termamyl-like a- 
bi amylase variant, modified (e.g. higher) substrate specificity and/or modified (e.g. higher) 
m specificity with respect to cleavage (hydrolysis) of substrate, it appears that the following 
"%$ codon positions for the amino acid sequence shown in SEQ ID NO 2 (or equivalent codon 
S = ;J positions for another parent Termamyl-like a-amylase in the context of the invention) may 
p particularly appropriately be targeted: 
% 13-18 
W 50-56 
5) 70-76 

102-109 

163-172 

189-199 

229-235 
25 360-264 

327-335 

The random mutagenesis of a DNA sequence encoding a parent a-amylase to 
be performed in accordance with step a) of the above-described method of the invention may 
conveniently be performed by use of any method known in the art. 
30 For instance, the random mutagenesis may be performed by use of a suitable 

physical or chemical mutagenizing agent, by use of a suitable oligonucleotide, or by 
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subjecting the DNA sequence to PCR generated mutagenesis. Furthermore, the random 
mutagenesis may be performed by use of any combination of these mutagenizing agents. 

The mutagenizing agent may, e.g., be one which induces transitions, 
transversions, inversions, scrambling, deletions, and/or insertions. 

Examples of a physical or chemical mutagenizing agent suitable for the present 
purpose include ultraviolet (UV) irradiation, hydroxylamine, N-methyl-N'-nitro-N- 
nitrosoguanidine (MNNG), Omethyl hydroxylamine, nitrous acid, ethyl methane sulphonate 
(EMS), sodium bisulphite, formic acid, and nucleotide analogues. 

When such agents are used, the mutagenesis is typically performed by 
incubating the DNA sequence encoding the parent enzyme to be mutagenized in the presence 
of the mutagenizing agent of choice under suitable conditions for the mutagenesis to take 
place, and selecting for mutated DNA having the desired properties. 

When the mutagenesis is performed by the use of an oligonucleotide, the 
oligonucleotide may be doped or spiked with the three non-parent nucleotides during the 
synthesis of the oligonucleotide at the positions which are to be changed. The doping or 
spiking may be done so that codons for unwanted amino acids are avoided. The doped or 
spiked oligonucleotide can be incorporated into the DNA encoding the amylolytic enzyme 
by any published technique, using e.g. PCR, LCR or any DNA polymerase and ligase. 

When PCR-generated mutagenesis is used, either a chemically treated or non- 
treated gene encoding a parent cx-amylase enzyme is subjected to PCR under conditions that 
increase the mis incorporation of nucleotides (Deshler 1992; Leung et al., Technique, Vol.1, 
1989, pp. 11-15). 

A mutator strain of E. coli (Fowler et al. , Molec. Gen. Genet. , 133, 1974, pp. 
179-191), S. cereviseae or any other microbial organism may be used for the random 
mutagenesis of the DNA encoding the amylolytic enzyme by e.g. transforming a plasmid 
containing the parent enzyme into the mutator strain, growing the mutator strain with the 
plasmid and isolating the mutated plasmid from the mutator strain. The mutated plasmid may 
subsequently be transformed into the expression organism. 

The DNA sequence to be mutagenized may conveniently be present in a 
genomic or cDNA library prepared from an organism expressing the parent amylolytic 
enzyme. Alternatively, the DNA sequence may be present on a suitable vector such as a 
plasmid or a bacteriophage, which as such may be incubated with or otherwise exposed to 
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the mutagenizing agent. The DNA to be mutagenized may also be present in a host cell 
either by being integrated in the genome of said cell or by being present on a vector harbored 
in the cell. Finally, the DNA to be mutagenized may be in isolated form. It will be 
understood that the DNA sequence to be subjected to random mutagenesis is preferably a 
cDNA or a genomic DNA sequence. 

In some cases it may be convenient to amplify the mutated DNA sequence 
prior to the expression step (b) or the screening step (c) being performed. Such amplification 
may be performed in accordance with methods known, in the art, the presently preferred 
method being PCR-generated amplification using oligonucleotide primers prepared on the 
basis of the DNA or amino acid sequence of the parent enzyme. 

Subsequent to the incubation with or exposure to the mutagenizing agent, the 
mutated DNA is expressed by culturing a suitable host cell carrying the DNA sequence under 
conditions allowing expression to take place. The host cell used for this purpose may be one 
which has been transformed with the mutated DNA sequence, optionally present on a vector, 
or one which was carried the DNA sequence encoding the parent enzyme during the 
mutagenesis treatment. Examples of suitable host cells are the following: gram positive 
bacteria such as Bacillus subtilis, Bacillus licheniformis , Bacillus lentus, Bacillus brevis, 
Bacillus stearothermophilus, Bacillus alkalophilus , Bacillus amyloliquefaciens , Bacillus 
coagulans, Bacillus circulans, Bacillus lautus, Bacillus megaterium, Bacillus thuringiensis , 
Streptomyces lividans or Streptomyces murinus\ and gram negative bacteria such as E, coli. 

The mutated DNA sequence may further comprise a DNA sequence encoding 
functions permitting expression of the mutated DNA sequence. 

Localized random mutagenesis : the random mutagenesis may advantageously be localized to 
a part of the parent a-amylase in question. This may, e.g., be advantageous when certain 
regions of the enzyme have been identified to be of particular importance for a given 
property of the enzyme, and when modified are expected to result in a variant having 
improved properties. Such regions may normally be identified when the tertiary structure 
of the parent enzyme has been elucidated and related to the function of the enzyme. 

The localized random mutagenesis is conveniently performed by use of PCR- 
generated mutagenesis techniques as described above or any other suitable technique known 
in the art. 
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Alternatively, the DNA sequence encoding the part of the DNA sequence to 
be modified may be isolated, e.g. by being inserted into a suitable vector, and said part may 
subsequently be subjected to mutagenesis by use of any of the mutagenesis methods discussed 
above. 

With respect to the screening step in the above-mentioned method of the 
invention, this may conveniently performed by use of aa filter assay based on the following 
principle: 

A microorganism capable of expressing the mutated amylolytic enzyme of 
interest is incubated on a suitable medium and under suitable conditions for the enzyme to 
be secreted, the medium being provided with a double filter comprising a first protein- 
binding filter and on top of that a second filter exhibiting a low protein binding capability. 
The microorganism is located on the second filter. Subsequent to the incubation, the first 
filter comprising enzymes secreted from the microorganisms is separated from the second 
filter comprising the microorganisms. The first filter is subjected to screening for the desired 
enzymatic activity and the corresponding microbial colonies present on the second filter are 
identified. 

The filter used for binding the enzymatic activity may be any protein binding 
filter e.g. nylon or nitrocellulose. The top filter carrying the colonies of the expression 
organism may be any filter that has no or low affinity for binding proteins e.g. cellulose 
acetate or Durapore™. The filter may be pretreated with any of the conditions to be used 
for screening or may be treated during the detection of enzymatic activity. 

The enzymatic activity may be detected by a dye, fluorescence, precipitation, 
pH indicator, IR-absorbance or any other known technique for detection of enzymatic 
activity. 

The detecting compound may be immobilized by any immobilizing agent e.g. 
agarose, agar, gelatine, poly aery lamide, starch, filter paper, cloth; or any combination of 
immobilizing agents. 

a-Amylase activity is detected by Cibacron Red labelled amylopectin, which 
is immobilized on agarose. For screening for variants with increased thermal and high-pH 
stability, the filter with bound a-amylase variants is incubated in a buffer at pH 10.5 and 60° 
or 65 °C for a specified time, rinsed briefly in de ionized water and placed on the 
amylopectin-agarose matrix for activity detection. Residual activity is seen as lysis of 
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Cibacron Red by amylopectin degradation. The conditions are chosen to be such that activity 
due to the a-amylase having the amino acid sequence shown in SEQ ID No.l can barely be 
detected. Stabilized variants show, under the same conditions, increased color intensity due 
to increased liberation of Cibacron Red. 

For screening for variants with an activity optimum at a lower temperature 
and/or over a broader temperature range, the filter with bound variants is placed directly on 
the amylopectin-Cibacron Red substrate plate and incubated at the desired temperature (e.g. 
4°C, 10°C or 30°C) for a specified time. After this time activity due to the a-amylase 
having the amino acid sequence shown in SEQ ID No.l can barely be detected, whereas 
variants with optimum activity at a lower temperature will show increase amylopectin lysis. 
Prior to incubation onto the amylopectin matrix, incubation in all kinds of desired media - 
e.g. solutions containing Ca 2+ , detergents, EDTA or other relevant additives - can be carried 
out in order to screen for changed dependency or for reaction of the variants in question with 
such additives. 

Testing of variants of the invention 

The testing of variants of the invention may suitably be performed by 
determining the starch-degrading activity of the variant, for instance by growing host cells 
transformed with a DNA sequence encoding a variant on a starch-containing agarose plate 
and identifying starch-degrading host cells. Further testing as to altered properties (including 
specific activity, substrate specificity, cleavage pattern, thermoactivation, pH optimum, pH 
dependency, temperature optimum, and any other parameter) may be performed in 
accordance with methods known in the art. 

Expression of a-amylase variants 

According to the invention, a DNA sequence encoding the variant produced 
by methods described above, or by any alternative methods known in the art, can be 
expressed, in enzyme form, using an expression vector which typically includes control 
sequences encoding a promoter, operator, ribosome binding site, translation initiation signal, 
and, optionally, a repressor gene or various activator genes. 

The recombinant expression vector carrying the DNA sequence encoding an 
a-amylase variant of the invention may be any vector which may conveniently be subjected 
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to recombinant DNA procedures, and the choice of vector will often depend on the host cell 
into which it is to be introduced. Thus, the vector may be an autonomously replicating 
vector, i.e. a vector which exists as an extrachromosomal entity, the replication of which is 
independent of chromosomal replication, e.g. a plasmid, a bacteriophage or an 
5 extrachromosomal element, minichromosome or an artificial chromosome. Alternatively, the 
vector may be one which, when introduced into a host cell, is integrated into the host cell 
genome and replicated together with the chromosome(s) into which it has been integrated. 

In the vector, the DNA sequence should be operably connected to a suitable 
promoter sequence. The promoter may be any DNA sequence which shows transcriptional 
10 activity in the host cell of choice and may be derived from genes encoding proteins either 
homologous or heterologous to the host cell. Examples of suitable promoters for directing 
O the transcription of the DNA sequence encoding an a-amylase variant of the invention, 
y especially in a bacterial host, are the promoter of the lac operon of E. coli, the Streptomyces 
jjj coelicolor agarase gene dag A promoters, the promoters of the Bacillus licheniformis a- 
15 amylase gene (amyL), the promoters of the Bacillus stearothermophilus maltogenic amylase 
|y gene (amyM), the promoters of the Bacillus amyloliquefaciens a-amylase (amyQ), the 
p promoters of the Bacillus subtilis xylA and xylB genes etc. For transcription in a fungal 
% host, examples of useful promoters are those derived from the gene encoding A. oryzae 
w TAKA amylase, Rhizomucor miehei aspartic proteinase, A. niger neutral a-amylase, A. niger 
2 ; B acid stable a-amylase, A. niger glucoamylase, Rhizomucor miehei lipase, A. oryzae alkaline 
protease, A. oryzae triose phosphate isomerase or A, nidulans acetamidase. 

The expression vector of the invention may also comprise a suitable 
transcription terminator and, in eukaryotes, polyadenylation sequences operably connected 
to the DNA sequence encoding the a-amylase variant of the invention. Termination and 

2 5 polyadenylation sequences may suitably be derived from the same sources as the promoter. 

The vector may further comprise a DNA sequence enabling the vector to 
replicate in the host cell in question. Examples of such sequences are the origins of 
replication of plasmids pUC19, pACYC177, pUBllO, pE194, pAMBl and pIJ702. 

The vector may also comprise a selectable marker, e.g. a gene the product of 

3 0 which complements a defect in the host cell, such as the dal genes from B. subtilis or B, 

licheniformis, or one which confers antibiotic resistance such as ampicillin, kanamycin, 
chloramphenicol or tetracyclin resistance. Furthermore, the vector may comprise Aspergillus 
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selection markers such as amdS, argB, niaD and sC, a marker giving rise to hygromycin 
resistance, or the selection may be accomplished by co-transformation, e.g. as described in 
WO 91/17243. 

While intracellular expression may be advantageous in some respects, e.g. 
5 when using certain bacteria as host cells, it is generally preferred that the expression is 
extracellular. In general, the Bacillus a-amylases mentioned herein comprise a preregion 
permitting secretion of the expressed protease into the culture medium. If desirable, this 
preregion may be replaced by a different preregion or signal sequence, conveniently 
accomplished by substitution of the DNA sequences encoding the respective preregions. 
10 The procedures used to ligate the DNA construct of the invention encoding an 

a-amylase variant, the promoter, terminator and other elements, respectively, and to insert 
p them into suitable vectors containing the information necessary for replication, are well 
p known to persons skilled in the art (cf., for instance, Sambrook et al. (1989)). 
|W The cell of the invention, either comprising a DNA construct or an expression 

Ifp vector of the invention as defined above, is advantageously used as a host cell in the 
J7V recombinant production of an a-amylase variant of the invention. The cell may be 
;JL transformed with the DNA construct of the invention encoding the variant, conveniently by 
Cn integrating the DNA construct (in one or more copies) in the host chromosome. This 
p integration is generally considered to be an advantage as the DNA sequence is more likely 
5jg to be stably maintained in the cell. Integration of the DNA constructs into the host 
chromosome may be performed according to conventional methods, e.g. by homologous or 
heterologous recombination. Alternatively, the cell may be transformed with an expression 
vector as described above in connection with the different types of host cells. 

The cell of the invention may be a cell of a higher organism such as a mammal 

2 5 or an insect, but is preferably a microbial cell, e.g. a bacterial or a fungal (including yeast) 

cell. 

Examples of suitable bacteria are gram positive bacteria such as Bacillus 
subtilis, Bacillus licheniformis , Bacillus lentus, Bacillus brevis, Bacillus stearothermophilus , 
Bacillus alkalophilus, Bacillus amyloliquefaciens, Bacillus coagulans, Bacillus circulans, 

3 0 Bacillus lautus, Bacillus megaterium, Bacillus thuringiensis , or Streptomyces lividans or 

Streptomyces murinus, or gram negative bacteria such as E. coli. The transformation of the 



- 43 - 



bacteria may, for instance, be effected by protoplast transformation or by using competent 
cells in a manner known per se. 

The yeast organism may favorably be selected from a species of 
Saccharomyces or Schizosaccharomyces, e.g. Saccharomyces cerevisiae. The filamentous 
fungus may advantageously belong to a species of Aspergillus, e.g. Aspergillus oryzae or 
Aspergillus niger. Fungal cells may be transformed by a process involving protoplast 
formation and transformation of the protoplasts followed by regeneration of the cell wall in 
a manner known per se. A suitable procedure for transformation of Aspergillus host cells 
is described in EP 238 023. 

In a yet further aspect, the present invention relates to a method of producing 
an a-amylase variant of the invention, which method comprises cultivating a host cell as 
described above under conditions conducive to the production of the variant and recovering 
the variant from the cells and/or culture medium. 

The medium used to cultivate the cells may be any conventional medium 
suitable for growing the host cell in question and obtaining expression of the a-amylase 
variant of the invention. Suitable media are available from commercial suppliers or may be 
prepared according to published recipes (e.g. as described in catalogues of the American 
Type Culture Collection). 

The a-amylase variant secreted from the host cells may conveniently be 
recovered from the culture medium by well-known procedures, including separating the cells 
from the medium by centrifugation or filtration, and precipitating proteinaceous components 
of the medium by means of a salt such as ammonium sulphate, followed by the use of 
chromatographic procedures such as ion exchange chromatography, affinity chromatography, 
or the like. 

Industrial Applications 

The a-amylase variants of this invention possesses valuable properties allowing 
for various industrial applications. In particular the enzyme variants finds potential 
applications as a component in washing, dishwashing and hard surface cleaning detergent 
compositions, but it may also be useful in the production of sweeteners and ethanol from 
starch and for textile desizing. Conditions for conventional starch converting processes and 
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liquefaction and/or saccharification processes are described in for instance US Patent No. 
3,912,590 and EP patent publications Nos. 252,730 and 63,909, 



Production of sweeteners from starch : A "traditional" process for conversion of starch to 
5 fructose syrups normally consists of three consecutive enzymatic processes, viz. a 

liquefaction process followed by a saccharification process and an isomerization process. 

During the liquefaction process, starch is degraded to dextrins by an a-amylase (e.g. 

Termamyl™) at pH values between 5.5 and 6.2 and at temperatures of 95-160°C for a period 

of approx. 2 h. In order to ensure an optimal enzyme stability under these conditions, 1 mM 
10 of calcium is added (40 ppm free calcium ions). 

After the liquefaction process the dextrins are converted into dextrose by 
C addition of a glucoamylase (e.g. AMG™) and a debranching enzyme, such as an isoamylase 
ly or a pullulanase (e.g. Promozyme™). Before this step the pH is reduced to a value below 
in 4.5, maintaining the high temperature (above 95 °C), and the liquefying a-amylase activity 
jLS is denatured. The temperature is lowered to 60 °C, and glucoamylase and debranching 
W enzyme are added. The saccharification process proceeds for 24-72 hours, 
r;: After the saccharification process the pH is increased to a value in the range 

tJ. of 6-8, preferably pH 7.5, and the calcium is removed by ion exchange. The dextrose syrup 

is then converted into high fructose syrup using, e.g., an immobilized glucose isomerase 

(such as Sweetzyme™). 

At least 3 enzymatic improvements of this process could be obtained. All 

three improvements could be seen as individual benefits, but any combination (e.g. 1+2, 

14-3, 2+3 or 1+2+3) could be employed: 

2 5 Improvement 1 . Reduction of the calcium dependency of the liquefying alpha- amylase. 

Addition of free calcium is required to ensure adequately high stability of the 
a-amylase, but free calcium strongly inhibits the activity of the glucose isomerase and needs 
to be removed, by means of an expensive unit operation, to an extent which reduces the level 
of free calcium to below 3-5 ppm. Cost savings could be obtained if such an operation could 

3 0 be avoided and the liquefaction process could be performed without addition of free calcium 

ions. 
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To achieve that, a less calcium-dependent Termamyl-like a-amylase which is 
stable and highly active at low concentrations of free calcium ( < 40 ppm) is required. Such 
a Termamyl-like a-amylase should have a pH optimum at a pH in the range of 4.5-6.5, 
preferably in the range of 4.5-5.5. 

Improvement 2 . Reduction of formation of unwanted Maillard products 

The extent of formation of unwanted Maillard products during the liquefaction 

process is dependent on the pH. Low pH favors reduced formation of Maillard products. 

It would thus be desirable to be able to lower the process pH from around pH 6.0 to a value 

around pH 4.5; unfortunately, all commonly known, thermostable Termamyl-like a-amylases 

are not very stable at low pH (i.e. pH < 6.0) and their specific activity is generally low. 

Achievement of the above-mentioned goal requires a Termamyl-like a-amylase 

which is stable at low pH in the range of 4.5-5.5 and at free calcium concentrations in the 

range of 0-40 ppm, and which maintains a high specific activity. 

Improvement 3 . 

It has been reported previously (US patent 5,234,823) that when saccharifying 
with A, niger glucoamylase and B. acidopullulyticus pullulanase, the presence of residual a- 
amylase activity from the liquefaction process can lead to lower yields of dextrose if the a- 
amylase is not inactivated before the saccharification stage. This inactivation can typically 
be carried out by adjusting the pH to below 4.3 at 95 °C, before lowering the temperature 
to 60 °C for saccharification. 

The reason for this negative effect on dextrose yield is not fully understood, 
but it is assumed that the liquefying a-amylase (for example Termamyl™ 120 L from 5. 
licheniformis) generates "limit dextrins" (which are poor substrates for B. acidopullulyticus 
pullulanase) by hydrolyzing 1 ,4-alpha-glucosidic linkages close to and on both sides of the 
branching points in amylopectin. Hydrolysis of these limit dextrins by glucoamylase leads 
to a build-up of the trisaccharide panose, which is only slowly hydrolyses by glucoamylase. 

The development of a thermostable a-amylase which does not suffer from this 
disadvantage would be a significant process improvement, as no separate inactivation step 
would be required. 
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If a Termamyl-like, low-pH-stable a-amylase is developed, an alteration of the 
specificity could be an advantage needed in combination with increased stability at low pH. 

The methodology and principles of the present invention make it possible to 
design and produce variants according to the invention having the required properties as 
outlined above. 

Detergent Compositions 

According to the invention, the a-amylase may typically be a component of 
a detergent composition. As such, it may be included in the detergent composition in the 
form of a non-dusting granulate, a stabilized liquid, or a protected enzyme. Non-dusting 
granulates may be produced, e.g. as disclosed in US 4,106,991 and 4,661 ,452 (both to Novo 
Industri A/S) and may optionally be coated by methods known in the art. Examples of waxy 
coating materials are poly (ethylene oxide) products (poly ethylenegly col, PEG) with mean 
molar weights of 1000 to 20000, ethoxylated nonylphenols having from 16 to 50 ethylene 
oxide units; ethoxylated fatty alcohols in which the alcohol contains from 12 to 20 carbon 
atoms and in which there are 15 to 80 ethylene oxide units; fatty alcohols; fatty acids; and 
mono- and di- and triglycerides of fatty acids. Examples of film-forming coating materials 
suitable for application by fluid bed techniques are given in patent GB 1483591. Liquid 
enzyme preparations may, for instance, be stabilized by adding a polyol such as propylene 
glycol, a sugar or sugar alcohol, lactic acid or boric acid according to established methods. 
Other enzyme stabilizers are well known in the art. Protected enzymes may be prepared 
according to the method disclosed in EP 238,216, 

The detergent composition of the invention may be in any convenient form, 
e.g. as powder, granules, paste or liquid. A liquid detergent may be aqueous, typically 
containing up to 70% of water and 0-30% of organic solvent, or nonaqueous. 

The detergent composition comprises one or more surfactants, each of which 
may be anionic, nonionic, cationic, or zwitterionic. The detergent will usually contain 0- 
50% of anionic surfactant such as linear alkylbenzenesulfonate (LAS), alpha-olefinsulfonate 
(AOS), alkyl sulfate (fatty alcohol sulfate) (AS), alcohol ethoxysulfate (AEOS or AES), 
secondary alkanesulfonates (SAS), alpha-sulfo fatty acid methyl esters, alkyl- or 
alkenylsuccinic acid or soap. It may also contain 0-40% of nonionic surfactant such as 
alcohol ethoxylate (AEO or AE), carboxylated alcohol ethoxylates, nonylphenol ethoxylate, 
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alky lpoly glycoside, alkyldimethylamineoxide, ethoxylated fatty acid monoethanolamide, fatty 
acid monoethanolamide, or poly hydroxy alkyl fatty acid amide (e.g. as described in WO 
92/06154), 

The detergent composition may additionally comprise one or more other 
enzymes, such as lipase, cutinase, protease, cellulase, peroxidase, e.g., laccase. 

The detergent may contain 1-65% of a detergent builder or complexing agent 
such as zeolite, diphosphate, triphosphate, phosphonate, citrate, nitrilotriacetic acid (NT A), 
ethylenediaminetetraacetic acid (EDTA), diethylenetriamiqepentaacetic acid (DTMPA), alkyl- 
or alkenylsuccinic acid, soluble silicates or layered silicates (e.g. SKS-6 from Hoechst). The 
detergent may also be unbuilt, i.e. essentially free of detergent builder. 

The detergent may comprise one or more polymers. Examples are 
carboxymethylcellulose (CMC), poly(vinylpyrrolidone) (PVP), polyethyleneglycol (PEG), 
poly (vinyl alcohol) (PVA), polycarboxylates such as polyacrylates, maleic/acrylic acid 
copolymers and lauryl methacrylate/acrylic acid copolymers. 

The detergent may contain a bleaching system which may comprise a H 2 0 2 
source such as perborate or percarbonate which may be combined with a peracid-forming 
bleach activator such as tetraacetylethylenediamine (TAED) or nonanoyloxybenzenesulfonate 
(NOBS). Alternatively, the bleaching system may comprise peroxy acids of e.g. the amide, 
imide, or sulfone type. 

The enzymes of the detergent composition of the invention may be stabilized 
using conventional stabilizing agents, e.g. a polyol such as propylene glycol or glycerol, a 
sugar or sugar alcohol, lactic acid, boric acid, or a boric acid derivative as e.g. an aromatic 
borate ester, and the composition may be formulated as described in e.g. WO 92/19709 and 
WO 92/19708. 

The detergent may also contain other conventional detergent ingredients such 
as e.g. fabric conditioners including clays, foam boosters, suds suppressors, anti-corrosion 
agents, soil-suspending agents, anti-soil redeposition agents, dyes, bactericides, optical 
brighteners, or perfume. 

The pH (measured in aqueous solution at use concentration) will usually be 

neutral or alkaline, e.g. 7-11. 

Particular forms of detergent compositions within the scope of the invention 

include: 
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1) A detergent composition formulated as a granulate having a bulk density of at least 600 
g/1 comprising 





Linear alkylbenzenesulfonate (calculated as acid) 


7 - 


12% 


5 


Alcohol ethoxysulfate (e.g. C 12 . 18 alcohol, 1-2 EO 
or alk"vl sulfate (e & C 1 ., 


i 
i 


A 07 
H- /o 




Alcohol ethoxvlate fe C\, ,^ alcohol 7 FOl 




Q07 

y /o 




Sodium carbonate fas NVuCO^ 

W V/ A VJ- A A A V^GLA L> 1 AC*. L w \ CIO JL T <\'}\*s\J *1 J 


14 

It 1 


lVi°/« 

IAJ /0 




Soluble silicate fas Na^O ?S.O^ 




f, 07 

O /o 




Zeolite fas NaAlSiCU 




11% 


10 


Sodium sulfate (as Na 2 S0 4 ) 


0 - 


6% 


p 


Sodium citrate/citric acid (as C 6 H 5 Na 3 0 7 /C 6 H 8 0 7 ) 


0 - 


15% 




Sodium perborate (as NaB0 3 .H 2 0) 


11 - 


18% 




TAED 


2 - 


6% 




Carboxymethylcellulose 


0 - 


2% 




Polymers (e.g. maleic/acrylic acid copolymer, 
PVP, PEG) 


0 - 


3% 




Enzymes (calculated as pure enzyme protein) 


0.0001 


-0.1% 




Minor ingredients (e.g. suds suppressors, perfume, 
optical brightener, photobleach) 


0 - 


5% 



5~b 



2) A detergent composition formulated as a granulate having a bulk density of at least 600 
g/1 comprising 
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Linear alkylbenzenesulfonate (calculated as acid) 


6 - 11% 


Alcohol ethoxysulfate (e.g. C 12 . 18 alcohol, 1-2 EO 
or alkyl sulfate (e.g. C 16 _ i8 )) 


1 - 3% 


Alcohol ethoxylate (e.g. C I4 . 15 alcohol, 7 EO) 


5 - 9% 


Sodium carbonate (as Na 2 C0 3 ) 


15-21% 


Soluble silicate (as Na 2 0,2Si0 2 ) 


1 - 4% 


Zeolite (as NaAlSi0 4 ) 


24 - 34% 


Sodium sulfate (as Na 2 S0 4 ) 


4 - 10% . 


Sodium citrate/citric acid (as C 6 H 5 Na 3 0 7 /C 6 H 8 0 7 ) 


0 - 15% 


Carboxymethylcellulose 


0- 2% 


Polymers (e.g. maleic/acrylic acid copolymer, 
PVP, PEG) 


1 - 6% 


Enzymes (calculated as pure enzyme protein) 


0.0001 - 0.1% 


Minor ingredients (e.g. suds suppressors, 
perfume) 


0 - 5% 



3) A detergent composition formulated as a granulate having a bulk density of at least 
g/1 comprising 
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Linear alkylbenzenesulfonate (calculated as acid) 


j - y /o 


Alcohol ethoxylate (e.g. C u _ 15 alcohol, 7 EO) 


/ - 1M- /O 


Soap as fatty acid (e.g. C 16 _ 2 2 fatty acid) 


l ^ or 


Sodium carbonate (as Na 2 C0 3 ) 


in i 7 of 


boIuDle silicate (as JNa 2 u,Zoiu 2 ,) 


1 Q Of 


Zeolite (as JNaAloiU 4 ; 


ZJ - jj /o 


oociium suiiate ^as iS^oKj^-) 




oocuum peruoraie ^as iNdJDW3.ri2w^ 


O 1U /o 


TAED 


2 - 8% 


Phosphonate (e.g. EDTMPA) 


0 - 1% 


Carboxymethylcellulose 


0 - 2% 


Polymers (e.g. maleic/acrylic acid copolymer, 
PVP, PEG) 


0 - 3% 


Enzymes (calculated as pure enzyme protein) 


0.0001 - 0.1% 


Minor ingredients (e.g. suds suppressors, 
perfume, optical brightener) 


0 - 5% 



4) A detergent composition formulated as a granulate having a bulk density of at least 
g/1 comprising 
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JL/incdl dlrLy lUCIlZcIlcbullUllalC ^UdlLUIdlCU db aClU^ 


o - 1Z /o 


r\l\-UIlUl CLllUAyidlC ^Cg. ^12-15 "aUUJLIUi, / CAJ ) 


1U - LD /o 


outiiuiii udrDOUdLC ^db in d2v_^W3^ 


JLH- - ZZ /o 


oOlUUie SlllCalc iNd2W,Zolw2>) 


1 - D /o 


Zeolite (as NaAlSi0 4 ) 


25 - 35% 


Sodium sulfate (as Na 2 S0 4 ) 


0 - 10% 


Carboxymethylcellulose 


0 - 2% 


Polymers (e.g. maleic/ aery lie acid copolymer, 
PVP, PEG) 


1 - 3% 


Enzymes (calculated as pure enzyme protein) 


0.0001 - 0.1% 


Minor ingredients (e.g. suds suppressors, 
perfume) 


0 - 5% 


5) An aqueous liquid detergent composition comprising 


Linear alkylbenzenesulfonate (calculated as acid) 


15 -21% 


Alcohol ethoxylate (e.g. C 12 . 15 alcohol, 7 EO or 
C 12-15 alcohol, 5 EO) 


12 - 18% 


Soap as fatty acid (e.g. oleic acid) 


3 - 13% 


Alkenylsuccinic acid (C 12 _ 14 ) 


0 - 13% 


Aminoethanol 


8 - 18% 


Citric acid 


2 - 8% 


Phosphonate 


0 - 3% 


Polymers (e.g. PVP, PEG) 


0-3% 


Borate (as B 4 0 7 ) 


0 - 2% 


Ethanol 


0 - 3% 


Propylene glycol 


8 - 14% 


Enzymes (calculated as pure enzyme protein) 


0.0001 - 0.1% 


Minor ingredients (e.g. dispersants, suds 
suppressors, perfume, optical brightener) 


0 - 5% 
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6) An aqueous structured liquid detergent composition comprising 



Linear alkylbenzenesulfonate (calculated as acid) 


15 -21% 


Alcohol ethoxylate (e.g. C 12 . 15 alcohol, 7 EO, or 
C 12 .i 5 alcohol, 5 EO) 


3 - 9% 


Soap as fatty acid (e.g. oleic acid) 


3 - 10% 


Zeolite (as NaAlSi0 4 ) 


14 - 22% 


Potassium citrate 


9 - 18% 


Borate (as B 4 0 7 ) 


6 - 2% 


Carboxymethylcellulose 


0 - 2% 


Polymers (e.g. PEG, PVP) 


0 - 3% 


Anchoring polymers such as, e.g., lauryl 
methacry late/acrylic acid copolymer; molar ratio 
25:1; MW 3800 


0 - 3% 


Glycerol 


0 - 5% 


Enzymes (calculated as pure enzyme protein) 


0.0001 - 0.1% 


Minor ingredients (e.g. dispersants, suds 
suppressors, perfume, optical brighteners) 


0 - 5% 



13 7) A detergent composition formulated as a granulate having a bulk density of at least 600 
2$ g/1 comprising 
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Paftv alcohol sulfate 


5 - 10% 


Ethoxvlated fattv acid monoethanolamide 


3 - 9% 


Soan as fattv acid 


0 - 3% 


Qodinm parhnnafp f ac T*Ja f^O^A 
o kJkj. l mil ^ai. uvjiidtt' y do 1 > u.9 v_- \_/ 3 y 


5 - 10% 


Soluble silirpttp (a<z Na„D ?SiO^ 


1 - 4% 


Zeolite (as NaAlSiO,') 


20 - 40% 


Sodium sulfate (as Na 2 S0 4 ) 


2 - 8% 


Sodium perborate (as NaB0 3 .H 2 0) 


12 - 18% 


TAED 


2 - 7% 


Polymers (e.g. maleic/acrylic acid copolymer, 
PEG) 


1 - 5% 


Enzymes (calculated as pure enzyme protein) 


0.0001 - 0.1% 


Minor ingredients (e.g. optical brightener, suds 
suppressors, perfume) 


0 - 5% 


8) A detergent composition formulated as a granulate comprising 


Linear alkylbenzenesulfonate (calculated as acid) 


8 - 14% 


Ethoxylated fatty acid monoethanolamide 


5 - 11% 


Soap as fatty acid 


0 - 3% 


Sodium carbonate (as Na 2 C0 3 ) 


4 - 10% 


Soluble silicate (as Na 2 0,2Si0 2 ) 


1 - 4% 


Zeolite (as NaAlSi0 4 ) 


30 - 50% 


Sodium sulfate (as Na 2 S0 4 ) 


3 - 11% 


Sodium citrate (as C 6 H 5 Na 3 0 7 ) 


5 - 12% 


Polymers (e.g. PVP, maleic/acrylic acid 
copolymer, PEG) 


1 - 5% 


Enzymes (calculated as pure enzyme protein) 


0.0001 - 0.1% 


Minor ingredients (e.g. suds suppressors, 
perfume) 


0 - 5% 
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9) A detergent composition formulated as a granulate comprising 



Linear alkylbenzenesulfonate (calculated as acid) 


6 - 12% 


Nonionic surfactant * 


1 - 4% 


Soap as fatty acid 


2 - 6% 


Sodium carbonate (as Na 2 C0 3 ) 


14 - 22% 


Zeolite (as NaAlSi0 4 ) 


18 - 32% 


Sodium sulfate (as Na 2 S0 4 ) 


5 - 20% 


Sodium citrate (as C 6 H 5 Na 3 0 7 ) 


3 - 8% 


Sodium perborate (as NaB0 3 .H 2 0) 


4 - 9% 


Bleach activator (e.g. NOBS or TAED) 


1 - 5% 


5\ rHn Yvmp \\\ vIpp! Inlrwp 
v.'Cii uuAy iiit'Liiy iL/t-iiLiivjot; 




Polymers (e.g. poly carboxy late or PEG) 


1 - 5% 


Enzymes (calculated as pure enzyme protein) 


0.0001 - 0.1% 


Minor ingredients (e.g. optical brightener, 
perfume) 


0 - 5% 
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10) An aqueous liquid detergent composition comprising 





Linear alkylbenzenesulfonate (calculated as acid) 


15 


23% 




Alcohol ethoxysulfate (e.g. C 12n alcohol, 2-3 
EO) 


8 


15% 


5 


Alcohol ethoxylate (e.g. C 12 . 15 alcohol, 7 EO, or 

U 12 _i5 alconol, j JbU) 


J 


O Of 

y fo 




Soap as fatty acid (e.g. lauric acid) 


U 


5 fo 




Aminoethanol 


1 
h 


^ or 
J fo 




Sodium citrate 


J 


1U fo 


lu 


xiyarotrope (e.g. soaium toiuene suiionaiej 


Z 


c or 

0 fo 




Jborate (as JD4U7) 


n 
U 


O Of 

Z fo 


Q 


Carboxymethylcellulose 


0 


1% 




Ethanol 


1 


3% 




Propylene glycol 


2 


5% 


Es 


Enzymes (calculated as pure enzyme protein) 


0.0001 


- 0.1% 




Minor ingredients (e.g. polymers, dispersants, 
perfume, optical brighteners) 


0 


5% 
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11) An aqueous liquid detergent composition comprising 



Linear alkylbenzenesulfonate (calculated as acid) 


20 - 32% 


Alcohol ethoxylate (e.g. C 12 . I5 alcohol, 7 EO, or 
C m5 alcohol, 5 EO) 


6 - 12% 


Aminoethanol 


2 - 6% 


Citric acid 


8 - 14% 


Borate (as B 4 0 7 ) 


1 - 3% 


Polymer (e.g. maleic/acrylic acid copolymer, 
anchoring polymer such as, e.g., lauryl 
methacrylate/acrylic acid copolymer) 


0 - 3% 


Glycerol 


3 - 8% 


Enzymes (calculated as pure enzyme protein) 


0.0001 - 0.1% 


Minor ingredients (e.g. hydro tropes, dispersants, 
perfume, optical brighteners) 


0 - 5% 



12) A detergent composition formulated as a granulate having a bulk density of at least 600 
g/1 comprising 



Anionic surfactant (linear alkylbenzenesulfonate, 
alkyl sulfate, alpha-olefinsulfonate, alpha-sulfo 
fatty acid methyl esters, alkanesulfonates, soap) 


25 - 40% 


Nonionic surfactant (e.g. alcohol ethoxylate) 


1 - 10% 


Sodium carbonate (as Na 2 C0 3 ) 


8 -25% 


Soluble silicates (as Na 2 0, 2Si0 2 ) 


5 - 15% 


Sodium sulfate (as Na 2 S0 4 ) 


0 - 5% 


Zeolite (as NaAlSi0 4 ) 


15 - 28% 


Sodium perborate (as NaB0 3 .4H 2 0) 


0 - 20% 


Bleach activator (TAED or NOBS) 


0 - 5% 


Enzymes (calculated as pure enzyme protein) 


0.0001 - 0. 1 % 


Minor ingredients (e.g. perfume, optical 
brighteners) 


0 - 3% 
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13) Detergent formulations as described in 1) - 12) wherein all or part of the linear 
alkylbenzenesulfonate is replaced by (C 12 -C 18 ) alkyl sulfate. 



14) A detergent composition formulated as a granulate having a bulk density of at least 600 
5 g/1 comprising 





(C 12 -C 18 ) alkyl sulfate 


9 


15% 




Alcohol ethoxylate 


3 


6% 




Polyhydroxy alkyl fatty acid amide 


1 


5% 




Zeolite (as NaAlSi0 4 ) 


10 


20% 


10 


Layered disilicate (e.g. SK56 from Hoechst) 


10 


20% 




Sodium carhonafp Na^PO^ 




12% 




Soluble silicate (as Na 2 0,2Si0 2 ) 


0 


6% 




Sodium citrate 


4 


8% 




Sodium percarbonate 


13 


22% 


IS 


TAED 


3 


8% 




Polymers (e.g. polycarboxylates and PVP = 


0 


5% 




Enzymes (calculated as pure enzyme protein) 


0.0001 


- 0.1% 




Minor ingredients (e.g. optical brightener, photo 
bleach, perfume, suds suppressors) 


0 


5% 



ZD 



15) A detergent composition formulated as a granulate having a bulk density of at least 600 
g/1 comprising 
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(C i2 -l, 18 ) alKyl sultate 


4 - 8% 


Alcohol ethoxylate 


11 - 15% 


Soap 


1 - 4% 


Zeolite MAP or zeolite A 


O c Ac erf 

35 - 45% 


i>oaium carbonate (as Na 2 C0 3 ) 


o o rrf 

2 - 8% 


ooluole silicate (as JNa 2 u,2oiU 2 ) 


0 - 4% 


Sodium percarbonate 


13 - 22% 


1 AjbJU 


1 o of 
1 - 0% 


Carboxymethyl cellulose 


0 - 3% 


Polymers (e.g. polycarboxylates and PVP) 


0 - 3% 


Enzymes (calculated as pure enzyme protein) 


0.0001 - 0.1% 


Minor ingredients (e.g. optical brightener, 
phosphonate, perfume) 


0 - 3% 



16) Detergent formulations as described in 1) - 15) which contain a stabilized or encapsulated 
peracid, either as an additional component or as a substitute for already specified bleach 
systems. 



17) Detergent compositions as described in 1), 3), 7), 9) and 12) wherein perborate is 
replaced by percarbonate. 

18) Detergent compositions as described in 1), 3), 7), 9), 12), 14) and 15) which additionally 
contain a manganese catalyst. The manganese catalyst may, e.g., be one of the compounds 
described in "Efficient manganese catalysts for low-temperature bleaching", Nature 369 , 
1994, pp. 637-639. 

19) Detergent composition formulated as a nonaqueous detergent liquid comprising a liquid 
nonionic surfactant such as, e.g., linear alkoxylated primary alcohol, a builder system (e.g. 
phosphate), enzyme and alkali. The detergent may also comprise anionic surfactant and/or 
a bleach system. 
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The a-amylase variant of the invention may be incorporated in concentrations 
conventionally employed in detergents. It is at present contemplated that, in the detergent 
composition of the invention, the a-amylase may be added in an amount corresponding to 
0.00001-1 mg (calculated as pure enzyme protein) of a- amylase per liter of wash liquor. 

Dishwashing Composition 

The dishwashing detergent composition comprises a surfactant which may be 
anionic, non- ionic, cationic, amphoteric or a mixture of these types. The detergent will 
contain 0-90% of non-ionic surfactant such as low- to non-foaming ethoxylated propoxylated 
straight-chain alcohols. 

The detergent composition may contain detergent builder salts of inorganic 
and/or organic types. The detergent builders may be subdivided into phosphorus-containing 
and non-phosphorus-containing types. The detergent composition usually contains 1-90% of 
detergent builders. 

Examples of phosphorus-containing inorganic alkaline detergent builders, when 
present, include the water-soluble salts especially alkali metal pyrophosphates, 
orthophosphates, and polyphosphates. An example of phosphorus-containing organic alkaline 
detergent builder, when present, includes the water-soluble salts of phosphonates. Examples 
of non-phosphorus-containing inorganic builders, when present, include water-soluble alkali 
metal carbonates, borates and silicates as well as the various types of water-insoluble 
crystalline or amorphous alumino silicates of which zeolites are the best-known 
representatives. 

Examples of suitable organic builders include the alkali metal, ammonium and 
substituted ammonium, citrates, succinates, malonates, fatty acid sulphonates, carboxymetoxy 
succinates, ammonium poly acetates, carboxylates, polycarboxylates, aminopolycarboxylates, 
poly acetyl carboxylates and polyhydroxsulphonates. 

Other suitable organic builders include the higher molecular weight polymers 
and co-polymers known to have builder properties, for example appropriate poly aery lie acid, 
polymaleic and poly aery lic/polymaleic acid copolymers and their salts. 

The dishwashing detergent composition may contain bleaching agents of the 
chlorine/bromine-type or the oxygen-type. Examples of inorganic chlorine/bromine-type 
bleaches are lithium, sodium or calcium hypochlorite and hypobromite as well as chlorinated 
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trisodium phosphate. Examples of organic chlorine/bromine-type bleaches are heterocyclic 
N-bromo and N-chloro imides such as trichloroisocyanuric, tribromoisocyanuric, 
dibromoisocyanuric and dichloroisocyanuric acids, and salts thereof with water-solubilizing 
cations such as potassium and sodium. Hydantoin compounds are also suitable. 

The oxygen bleaches are preferred, for example in the form of an inorganic 
persalt, preferably with a bleach precursor or as a peroxy acid compound. Typical examples 
of suitable peroxy bleach compounds are alkali metal perborates, both tetrahydrates and 
monohydrates, alkali metal percarbonates, persilicates and perphosphates. Preferred activator 
materials are TAED and glycerol triacetate. 

The dishwashing detergent composition of the invention may be stabilized 
using conventional stabilizing agents for the enzyme(s), e.g. a polyol such as e.g. propylene 
glycol, a sugar or a sugar alcohol, lactic acid, boric acid, or a boric acid derivative, e.g. an 
aromatic borate ester. 

The dishwashing detergent composition of the invention may also contain other 
conventional detergent ingredients, e.g. deflocculant material, filler material, foam 
depressors, anti-corrosion agents, soil-suspending agents, sequestering agents, anti-soil 
redeposition agents, dehydrating agents, dyes, bactericides, fluorescers, thickeners and 
perfumes. 

Finally, the ^-amylase variant of the invention may be used in conventional 
dishwashing detergents, e.g. in any of the detergents described in any of the following patent 
publications: 

EP 518719, EP 518720, EP 518721, EP 516553, EP 516554, EP 516555, GB 2200132, DE 
3741617, DE 3727911, DE 4212166, DE 4137470, DE 3833047, WO 93/17089, DE 
4205071, WO 52/09680, WO 93/18129, WO 93/04153, WO 92/06157, WO 92/08777, EP 
429124, WO 93/21299, US 5141664, EP 561452, EP 561446, GB 2234980, WO 93/03129, 
EP 481547, EP 530870, EP 533239, EP 554943, EP 346137, US 5112518, EP 318204, EP 
318279, EP 271155, EP 271156, EP 346136, GB 2228945, CA 2006687, WO 93/25651, EP 
530635, EP 414197, US 5240632. 
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EXAMPLES 
EXAMPLE 1 

Example on Homology building of TERM 

The overall homology of the B. licheniformis a-amylase (in the following 
referred to as TERM) to other Termamyl-like a-amylases is high and the percent similarity 
is extremely high. The similarity calculated for TERM to BSG (the B. stearothermophilus 
a-amylase with SEQ ID NO 6), and BAN (the B. amyloliquefaciens a-amylase with SEQ ID 
NO 4) using the University of Wisconsin Genetics Computer Group's program GCG gave 
89% and 78%, respectively. TERM has a deletion of 2 residues between residue G180 and 
K181 compared to BAN and BSG. BSG has a deletion of 3 residues between G371 and 1372 
in comparison with BAN and TERM. Further BSG has a C-terminal extension of more than 
20 residues compared to BAN and TERM. BAN has 2 residues less and TERM has one 
residue less in the N-terminal compared to BSG. 

The structure of the B. licheniformis (TERM) and of the B. amyloliquefaciens 
a-amylase (BAN), respectively, was model built on the structure disclosed in Appendix 1 
herein. The structure of other Termamyl-like a-amylases (e.g. those disclosed herein) may 
be built analogously. 

In comparison with the a-amylase used for elucidating the present structure, 
TERM differs in that it lacks two residues around 178-182. In order to compensate for this 
in the model structure, the HOMOLOGY program from BIOSYM was used to substitute the 
residues in equivalent positions in the structure (not only structurally conserved regions) 
except for the deletion point. A peptide bond was established between G179(G177) and 
K180(K180) in TERM (BAN). The close structural relationship between the solved structure 
and the model structure (and thus the validity of the latter) is indicated by the presence of 
only very few atoms found to be too close together in the model. 

To this very rough structure of TERM was then added all waters (605) and 
ions (4 Calcium and 1 Sodium) from the solved structure (Appendix 1) at the same 
coordinates as for said solved structure using the INSIGHT program. This could be done 
with only few overlaps - in other words with a very nice fit. This model structure were then 
minimized using 200 steps of Steepest descent and 600 steps of Conjugated gradient (see 
Brooks et al 1983, J. Computational Chemistry 4, p. 187-217). The minimized structure was 
then subjected to molecular dynamics, 5ps heating followed by up to 200ps equilibration but 
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more than 35ps. The dynamics as run with the Verlet algorithm and the equilibration 
temperature 300K were kept using the Behrendsen coupling to a water bath (Berendsen et. 
al., 1984, J. Chemical Physics 81, p. 3684-3690). Rotations and translations were removed 
every picosecond. The potential energy became stable after appr, 35ps equilibration. A 
mean dynamics structure was extracted and can be used for further analysis. 

EXAMPLE 2 

Determination of residues within 10 A from the ions present in the solved structure 

The coordinates of Appendix 1 are read into the INSIGHT program provided 
by BIOSYM technologies. The spatial coordinates are presented showing the bonds between 
the atoms. The ions are presented as well as the water atoms. The program package part 
of creating subset are used to create a 10 A subset around the Calcium and the Sodium ions 
in the structure using the command ZONE. All residues having an atom within the 10 A are 
compiled and written out by the LIST MOLECULE command. By giving the ions the name 
ium in the coordinate file a 10 A sphere around all atoms called ium is compiled. The 
specific residues identified in this manner are given further above in the section entitled 
ff Ca 2+ dependency". 

EXAMPLE 3 

Determination of cavities in the solved structure (Appendix 1) 

The solved structure exhibits many internal holes and cavities. When 
analyzing for such cavities the Connolly program is normally used (Lee, B. and Richards, 
F.M. (1971) J. MoL Biol. 55, p. 379-400). The program uses a probe with radius to search 
the external and internal surface of the protein. The smallest hole observable in this way has 
the probe radius. 

To analyze the solved structure a modified version of the Connolly program 
included in the program of INSIGHT were used. First the water molecules and the ions 
were removed by unmerging these atoms from the solved structure. By using the command 
MOLECULE SURFACE SOLVENT the solvent accessible surface area were calculated for 
all atoms and residues using a probe radius of 1,4 A, and displayed on the graphics screen 
together with the model of the solved structure. The internal cavities where then seen as dot 
surfaces with no connections to external surface. 
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Mutant suggestions for filling out the holes are given in the specification (in 
the section entitled "Variants with increased thermostability and/or altered temperature 
optimum"). By using the homology build structures or/and the sequence alignment mutations 
for the homologous structures of TERM and BSG and BAN can be made. 

EXAMPLE 4 

Construction of Termamyl™ variants in accordance with the invention 

Tennamyl (SEQ ID NO. 2) is expressed im5. subtilis from a plasmid denoted 
pDN1528. This plasmid contains the complete gene encoding Termamyl, amyL, the 
expression of which is directed by its own promoter. Further, the plasmid contains the 
origin of replication, ori 9 from plasmid pUBHO and the cat gene from plasmid pC194 
conferring resistance towards chloramphenicol. pDN1528 is shown in Fig. 9. 

A specific mutagenesis vector containing a major part of the coding region of 
SEQ ID NO 1 was prepared. The important features of this vector, denoted pJeENl, include 
an origin of replication derived from the pUC plasmids, the cat gene conferring resistance 
towards chloramphenicol, and a frameshift-containing version of the bla gene, the wild type 
of which normally confers resistance towards ampicillin (amp R phenotype). This mutated 
version results in an amp s phenotype. The plasmid pJeENl is shown in Fig. 10, and the E. 
coli origin of replication, ori, bla, cat , the 5 '-truncated version of the Tennamyl amylase 
gene, and selected restriction sites are indicated on the plasmid. 

Mutations are introduced in amyL by the method described by Deng and 
Nickoloff (1992, Anal. Biochem. 200 , pp. 81-88) except that plasmids with the "selection 
primer" (primer #6616; see below) incorporated are selected based on the amp R phenotype 
of transformed E. coli cells harboring a plasmid with a repaired bla gene, instead of 
employing the selection by restriction enzyme digestion outlined by Deng and Nickoloff. 
Chemicals and enzymes used for the mutagenesis were obtained from the Chameleon™ 
mutagenesis kit from Stratagene (catalogue number 200509). 

After verification of the DNA sequence in variant plasmids, the truncated 
gene, containing the desired alteration, is subcloned into pDN1528 as a Pstl-EcoW. fragment 
and transformed into a protease- and amylase-depleted Bacillus subtilis strain in order to 
express the variant enzyme. 
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The Termamyl variant V54W was constructed by the use of the following 
mutagenesis primer (written 5' to 3\ left to right): 
PG GTC GTA GGC ACC GTA GCC CCA ATC CGC TTG 

The Termamyl variant A52W + V54W was constructed by the use of the 
following mutagenesis primer (written 5' to 3\ left to right): 
PG GTC GTA GGC ACC GTA GCC CCA ATC CCA TTG GCT CG 

Primer #6616 (written 5' to 3', left to right; P denotes a 5' phosphate): 
P CTG TGA CTG GTG AGT ACT CAA CCA AGT C * 

EXAMPLE 5 

Saccharification in the presence of "residual" a-amylase activity 

Two appropriate Termamyl variants with altered specificity were evaluated by 
saccharifying a DE 10 (DE = dextrose equivalent) maltodextrin substrate with A. niger 
glucoamylase and B. acidopullulyticus pullulanase under conditions where the variant amylase 
was active. 

Saccharification : Substrates for saccharification were prepared by dissolving 230 g DE 10 
spray-dried maltodextrin, prepared from common corn starch, in 460 ml boiling deionized 
water and adjusting the dry substance (DS) content to approximately 30% w/w. The pH was 
adjusted to 4.7 (measured at 60 °C) and aliquots of substrate corresponding to 15 g dry 
weight were transferred to 50 ml blue cap glass flasks. 

The flasks were then placed in a shaking water bath equilibrated at 60 °C, and 
the enzymes added. The pH was readjusted to 4.7 where necessary. 

The following enzymes were used: 
Glucoamylase: AMG™ (Novo Nordisk A/S); dosage 0.18 AG/g DS 

Pullulanase: Promozyme™ (Novo Nordisk A/S); dosage 0.06 PUN/g DS 
a- Amylases: Termamyl™ (Novo Nordisk A/S); dosage 60 NU/g DS 
Termamyl variant V54W; dosage 60 NU/g DS 
Termamyl variant V54W + A52W; dosage 60 NU/g DS 

2 ml samples were taken periodically. The pH of each sample was adjusted 
to about 3.0, and the sample was then heated in a boiling water bath for 15 minutes to 
inactivate the enzymes. After cooling, the samples were treated with approximately 0.1 g 
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mixed-bed ion exchange resin (BIO-Rad 501 -X (D)) for 30 minutes on a rotary mixer and 
then filtered. The carbohydrate composition of each sample was determined by HPLC. The 
following results were obtained after 72 hours [DP n denotes a dextrose (D-glucose) oligomer 
with n glucose units]: 



a-amylase 


%DP, 


%DP 2 


%DP 3 


%DP 4 


None (control) 


95.9 


2.8 


0.4 


1.0 


V54W 


96.0 


2.9 


0.4 


0.8 


V54W 4- A52W 


95.9 


2.8 


0.4 


0.8 


Termamyl™ 


95.6 


2.8 


0.8 


0.8 



It can be seen from the above results that compared with the control (no a- 
amylase activity present during liquefaction), the presence of a-amylase activity from variants 
V54W and V54W + A52W did not lead to elevated panose (DP 3 ) levels. In contrast, 
Termamyl a-amylase activity resulted in higher levels of panose and a subsequent loss of D- 
glucose (DP X ) yield. 

Thus, if a-amylase variants V54W or V54W + A52W are used for starch 
liquefaction, it will not be necessary to inactivate the residual a-amylase activity before 'the 
commencement of saccharification. 

EXAMPLE 6 

Calcium-binding affinity of a-amylase variants of the invention 

Unfolding of amylases by exposure to heat or to denaturants such as guanidine 
hydrochloride is accompanied by a decrease in fluorescence. Loss of calcium ions leads to 
unfolding, and the affinity of a-amylases for calcium can be measured by fluorescence 
measurements before and after incubation of each a-amylase (e.g. at a concentration of 10 
/zg/ml) in a buffer (e.g. 50 mM HEPES, pH 7) with different concentrations of calcium (e.g. 
in the range of 1 juM-100 mM) or of EGTA (e.g. in the range of 1-1000 pM) [EGTA = 1,2- 
di(2-aminoethoxy)ethane-A ?r ,A^ ) A^ J A^-tetraacetic acid] for a sufficiently long period of time 
(such as 22 hours at 55 °C). 
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The measured fluorescence F is composed of contributions form the folded and 
unfolded forms of the enzyme. The following equation can be derived to describe the 
dependence of F on calcium concentration ([Ca]): 
F = [Ca]/(# diss + [Ca])(a N - jS N log([Ca])) + 
5 K d J(K dh5 + [Ca])(au - /3ulog([Ca])) 

where a N is the fluorescence of the native (folded) form of the enzyme, jS N is the linear 
dependence of o: N on the logarithm of the calcium concentration (as observed experimentally), 
ofy is the fluorescence of the unfolded form and (3^ is the linear dependence of a v on the 
logarithm of the calcium concentration. K dlss is the apparent calcium-binding constant for an 
10 equilibrium process as follows: 

^diss 

N~Ca U + Ca (N - native enzyme; U = unfolded enzyme) 

In fact, unfolding proceeds extremely slowly and is irreversible. The rate of 
15 unfolding is a dependent on calcium concentration, and the dependency for a given a-amylase 
provides a measure of the Ca-b hiding affinity of the enzyme. By defining a standard set of 
reaction conditions (e.g. 22 hours at 55 °C), a meaningful comparison of K dlss for different 
a-amylases can be made. The calcium dissociation curves for a-amylases in general can be 
fitted to the equation above, allowing determination of the corresponding values of K dhs . 
2 0 The following values for K dlss were obtained for a parent Termamyl-like a- 

amylase having the amino acid sequence shown in SEQ ID No. 1 of WO 95/26397 and for 
the indicated variant thereof according to the invention: 



a-Amylase 


K diss (mol/1) 


L351C + M430C + T183* + G184* 


1.7 (±0.5) x 10" 3 


Parent 


3.5 (±1.1) x 10" 1 



It is apparent from the above that the calcium-binding affinity of the variant 
3 0 in question binds calcium significantly more strongly than the parent, and thereby has a 
correspondingly lower calcium dependency than the parent. 
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SEQUENCE LISTING 



In the following SEQ ID Nos. 1, 3, 5 the 5', coding sequence and 3' sequence of the 
relevant a-amylase genes are illustrated. The 5' sequence is the first separate part of the 
sequence written with lower case letters, the coding sequence is the intermediate part of the 
sequence, where the signal sequence is written with lower case letters and the sequence 
encoding the mature a-amylase is written with upper case letters, and the 3' sequence is the 
third separate part of the sequence written with lower case letters. 

SEQ ID No. 1 

cggaagattggaagtacaaaaataagcaaaagattgtcaatcatgtcatgagccatgcgg-gagacggaaaaatcgtctta 
atgcacgatatttatgcaacgttcgcagatgctgctgaa-gagattattaaaaagctgaaagcaaaaggctatcaattggt 
aactgtatctcagcttgaagaagtgaagaagcagagaggctattgaataaatgagtagaagcgccatatcggcgcttttc 
ttttggaagaaaatatagggaaaatggtacttgttaaaaattcggaatatttatacaacatcatatgtttcacattgaaaggggaggagaatc 

atgaaacaacaaaaacggctttacgcccgattgctgacgctgttatttgcgctcatcttcttgctgc 
ctcattctgcagcagcggcgGCAAATCTTAATGGGACGCTGATGCAGTATTTTGAATGGTAC 
ATGCCCAATGACGGCCAA 
CATTGGAGGCGTTTGCAAAACGACTCGGCATATTTGGCTGAACACGGTATTACT 
GCCGTCTGGATTCCCCCGGCATATAA 
GGGAACGAGCCAAGCGGATGTGGGCTACGGTGCTTACGACCTTTATGATTTAGG 
GGAGTTTCATCAAAAAGGGACGGT'TC 
GGACAAAGTACGGCACAAAAGGAGAGCTGCAATCTGCGATCAAAAGTCTTCATT 
CCCGCGACATTAACGTTTACGGGGAT 
GTGGTCATCAACCACAAAGGCGGCGCTGATGCGACCGAAGATGTAACCGCGGTT 
GAAGTCGATCCCGCTGACCGCAACCG 
CGTAATTTCAGGAGAACACCTAATTAAAGCCTGGACACATTTTCATTTTCCGGG 
GCGCGGCAGCACATACAGCGATTTTA 
AATGGCATTGGTACCATTTTGACGGAACCGATTGGGACGAGTCCCGAAAGCTGA 
ACCGCATCTATAAGTTTCAAGGAAAG 
GCTTGGGATTGGGAAGTTTCCAATGAAAACGGCAACTATGATTATTTGATGTAT 
GCCGACATCGATXATGACCATCCTGA 
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TGTCGCAGCAGAAATTAAGAGATGGGGCACTTGGTATGCCAATGAACTGCAATT 

GGACGGTTTCC G T CTTGATGCTGTCA 

AACACATTAAATTTTCTTTTTTGCGGGATTGGGTTAATCATGTCAGGGAAAAAA 

CGGGGAAGGAAATGTTTACGGTAGCT 

GAATATTGGCAGAATGACTTGGGCGCGCTGGAAAACTATTTGAACAAAACAAAT 

TTTAATCATTCAGTGTTTGACGTGCC 

GCTTCATTATCAGTTCCATGCTGCATCGACACAGGGAGGCGGCTATGATATGAG 

GAAATTGCT, GAACGGTACGGTCGTTT 

CCAAGCATCCGTTGAAATCGGTTACATTTGTCGATAACCATGATACACAGCCGG 

GGCAATCGCTTGAGTCGACTGTCCAA 

ACATGGTTTAAGCCGCTTGCTTACGCTTTTATTCTCACAAGGGAATCTGGATACC 

CTCAGGTTTTCTACGGGGATATGTA 

CGGGACGAAAGGAGACTCCCAGCGCGAAATTCCTGCCTTGAAACACAAAATTG 

AACCGATCTTAAAAGCGAGAAAACAGT 

ATGCGTACGGAGCACAGCATGATTATTTCGACCACCATGACATTGTCGGCTGGA 

CAAGGGAAGGCGACAGCTCGGTT.GCA 

AATTCAGGTTTGGCGGCATTAATAACAGACGGACCCGGTGGGGCAAAGCGAAT 

GTATGTCGGCCGGCAAAACGCCGGTGA 

GACATGGCATGACATTACCGGAAACCGTTCGGAGCCGGTTGTCATCAATTCGGA 

AGGCTGGGGAGAGTTTCACGTAAACG 

GCGGGTCGGTTTCAATTTATGTTCAAAGATAG 

aagagcagagaggacggatttcctgaaggaaatccgtttttttatttt 

SEQ ID No. 2 

ANLNGTLMQYFEWYMPNDGQHWRRLQNDSAYLAEHGITAV 

WIPPAYKGTSQADVGYGAYDLYDLGEFHQKGTVRTKYGTK 

GELQSAIKSLH'SRDINVYGDWINHKGGADATEDVTAVEV 

DPADRNRVISGEHLKAWTHFHFPGRGSTYSDFKWHWYHF 

DGTDWDESRKLNRIYKFQGKAWDWEVSNENGNYDYLMYAD 

IDYDHPDVAAEIKRWGTWYANELQLDGFRLDAVKHIKFSF 

LRDWVNHVREKTGKEMFTVAEYWQNDLGALENYLNKTNFN 

HSVFDVPLHYQFHAASTQGGGYDMRKLLNGTVVSKHPLKS 
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VTFVDNHDTQPGQSLESTVQTWFKPLAYAFILTRESGYPQ 

VFYGDMYGTKGDSQREIPALKHKIEPILKARKQYAYGAQH 

DYFDHHDIVGWTREGDSSVANSGLAALITDGPGGAKRMYV 

GRQNAGETWHDITGNRSEPVVINSEGWGEFHVNGGSVSIY 

VQR 

SEQ ID No. 3 

gccccgcacatacgaaaagactggctgaaaacattgagcctttgatgactgatgatttggctgaagaagtggatcgattg 

tttgagaaaagaagaagaccataaaaataccttgtctgtcatcagacagggtattttttatgctgtccagactgtccgct 

gtgtaaaaataaggaataaaggggggttgttattattttactgatatgtaaaatataatttgtataagaaaatgagaggg agaggaaac 

atgattcaaaaacgaaagcggacagtttcgttcagacttgtgcttatgtgcacgctgttatttgtcagttt 

gccgattacaaaaacatcagccGTAAATGGCACGCTGATGCAGTATTTTGAATGGTATACGC 

CGAACGACGGCCAGCATT 

GGAAACGATTGCAGAATGATGCGGAACATTTATCGGATATCGGAATCACTGCCG 

TCTGGATTCCTCCCGCATACAAAGGA 

TTGAGCCAATCCGATAACGGATACGGACCTTATGATTTGTATGATTTAGGAGAA 

TTCCAGCAAAAAGGGACGGTCAGAAC 

GAAATACGGCACAAAATCAGAGCTTCAAGATGCGATCGGCTCACTGCATTCCCG 

GAACGTCCAAGTATACGGAGATGTGG 

TTTTGAATCATAAGGCTGGTGCTGATGCAACAGAAGATGTAACTGCCGTCGAAG 

TCAATCCGGCCAATAGAAATCAGGAA 

ACTTCGGAGGAATATCAAATCAAAGCGTGGACGGATTTTCGTTTTCCGGGCCGT 

GGAAACACGTACAGTGATTTTAAATG 

GCATTGGTATCATTTCGACGGAGCGGACTGGGATGAATCCCGGAAGATCAGCCG 

CATCTTTA A'<} TTTCGTGGGGAAGGAA 

AAGCGTGGGATTGGGAAGTATCAAGTGAAAACGGCAACTATGACTATTTAATGT 

ATGCT'GATGTTGACTACGACCACCCT 

GATGTCGTGGCAGAGACAAAAAAATGGGGTATCTGGTATGCGAATGAACTGTCA 

TTAGACGGCTTCCGTATTGATGCCGC 

CAAACATATTAAATTTTCATTTCTGCGTGATTGGGTTCAGGCGGTCAGACAGGC 

GACGGGAAAAGAAATGTTTACGGTTG 

CGGAGTATTGGCAGAATAATGCCGGGAAACTCGAAAACTACTTGAATAAAACA 
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AGCTTTAATCAATCCGTGTTTGATGTT 
CCGCTTCATTTCAATTTACAGGCGGCTTCCTCACAAGGAGGCGGATATGATATG 
AGGCGTTTGCTGGACGGTACCGTTGT 
GTCCAGGCATCCGGAAAAGGCGGTTACATTTGTTGAAAATCATGACACACAGCC 
GGGACAGTCATTGGAATCGACAGTCC 
AAACTTGGTTTAAACCGCTTGCATACGCCTTTATTTTGACAAGAGAATCCGGTTA 
TCCTCAGGTGTTCTATGGGGATATG 
TACGGGACAAAAGGGACATCGCCAAAGGAAATTCCCTCACTGAAAGATAATAT 
AGAGCCGATTTTAAAAGCGCGTAAGGA 
GTACGCATACGGGCCCCAGCACGATTATATTGACCACCCGGATGTGATCGGATG 
GACGAGGGAAGGTGACAGCTCCGCCG 
CCAAATCAGGTTTGGCCGCTTTAATCACGGACGGACCCGGCGGATCAAAGCGGA 
TGTATGCCGGCCTGAAAAATGCCGGC 
GAGACATGGTATGACATAACGGGCAACCGTTCAGATACTGTAAAAATCGGATCT 
GACGGCTGGGGAGAGTTTCATGTAAA 
CGATGGGTCCGTCTCCATTTATGTTCAGAAATAA 

ggtaataaaaaaacacctccaagctgagtgcgggtatcagcttgga 
ggtgcgtttattttttcagccgtatgacaaggtcggcatcaggtgtgacaaatacggtatgctggctgtcataggtgaca 
aatccgggttttgcgccgtttggctttttcacatgtctgatttttgtataatcaacaggcacggagccggaatctttcgc 
cttggaaaaataagcggcgatcgtagctgcttccaatatggattgttcatcgggatcgctgcttttaatcacaacgtggg atcc 

SEQ ID No. 4 

VNGTLMQYFEWYTPNDGQHWKRLQNDAEHLSDIGITAVWI 

ppaykglsqsdngygpydlydlgefqqkgtvrtkygtkse 

lqdaigslhsrnvqvygdvVlnhkagadatedvtavevnp 

anrnqetseeyqikawtdfrfpgrgntysdfkwhwyhfdg 

adwdesrkisrifkfrgegkawdwevssengnydylmyad 

vdydhpdvvaetkkwgiwyanelsldgfridaakhikfsf 

lrdwvqavrqatgkemftvaeywqnnagklenylnktsfn 

qsvfdvplhfnlqaassqgggydmrrlldgtvvsrhpeka 

vtfvenhdtqpgqslestvqtwfkplayafiltresgypq 

vfygdmygtkgtspkeipslkdniepilkarkeyaygpqh 



DYIDHPDVIGWTREGDSSAAKSGLAALITDGPGGSKRMYA 
GLKNAGETWYDITGNRSDTVKIGSDGWGEFHVNDGSVSIY 

SEQ ID No. 5 

5 aaattcgatattgaaaacgattacaaataaaaattataatagacgtaaacgttcgagggtttgctccctttttactcttt 
ttatgcaatcgtttcccttaattttttggaagccaaaccgtcgaatgtaacatttgattaagggggaagggcatt 
gtgct aacgtttcaccgcatcattcgaaaaggatggatgttcctgctcgcgtttttgctcactgtctcgctgttctgcccaacag 
gacagcccgccaaggctGCCGCACCGTTTAACGGCACCATGATGCAGTATTTTGAATGGT 
ACTTGCCGGATGATGGCACG 
1 0 TTATGGACC AAAGTGGCC AATGAAGCCAAC AACTTATCC AGCCTTGGCATC ACC 
G GCTCTTTGGCTGCCGCCCGCTTACAA 
J 1 } AGGAACAAGCCGCAGCGACGTAGGGTACGGAGTATACGACTTGTATGACCTCG 
JU GCGAATTCAATCAAAAAGGGACCGTCC 
M GCACAAAATACGGAACAAAAGCTCAATATCTTCAAGCCATTCAAGCCGCCCACG 
jT|15 CCGCTGGAATGCAAGTGTACGCCGAT 
r L. GTCGTGTTCGACCATAAAGGCGGCGCTGACGGCACGGAATGGGTGGACGCCGTC 
m GAAGTCAATCCGTCCGACCGCAACCA 
[rj AGAAATCTCGGGCACCTATCAAATCCAAGCATGGACGAAATTTGATTTTCCCGG 
| GCGGGGCAACACCTACTCCAGCTTTA 
20 ' AGTGGCGCTGGTACCATTTTGACGGCGTTGATTGGGACGAAAGCCGAAAATTGA 
GCCGCATTTACAAATTCCGCGGCATC 
GGCAAAGCGTGGGATTGGGAAGTAGACACGGAAAACGGAAACTATGACTACTT 
AATGTATGCCGACCTTGATATGGATCA 
TCCCGAAGTCGTGACCGAGCTGAAAAACTGGGGGAAATGGTATGTCAACACAA 
25 CGAACATTGATGGGTTCCGGCTTGATG 
CCGTCAAGCATATTAAGTTCAGTTTTTTTCCTGATTGGTTGTCGTATGTGCGTTC 
T C A G A'C TGGCAAGCCGCTATTTACC 
GTCGGGGAATATTGGAGCTATGACATCAACAAGTTGCACAATTACATTACGAAA 
ACAGACGGAACGATGTCTTTGTTTGA 
3 0 TGCCCCGTTACACAACAAATTTTATACCGCTTCCAAATCAGGGGGCGCATTTGA 
TATGCGCACGTTAATGACCAATACTC 
TCATGAAAGATCAACCGACATTGGCCGTCACCTTCGTTGATAATCATGACACCG 
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AACCCGGCCAAGCGCTGCAGTCA-TGG 

GTCGACCCATGGTTCAAACCGTTGGCTTACGCCTTTATTCTAACTCGGCAGGAA 

GGATACCCGTGCGTCTTTTATGGTGA 

CTATTATGGCATTCCACAATATAACATTCCTTCGCTGAAAAGCAAAATCGATCC 

GCTCCTCATCGCGCGCAGGGATTATG 

CTTACGGAACGCAACATGATTATCTTGATCACTCCGACATCATCGGGTGGACAA 

GGGAAGGGGGCACTGAAAAACCAGGA 

TCCGGACTGGCCGCACTGATCACCGATGGGCCGGGAGGAAGCAAATGGATGTA 

CGTTGGCAAACAACACGCTGGAAAAGT 

GTTCTATGACCTTACCGGCAACCGGAGTGACACCGTCACCATCAACAGTGATGG 

ATGGGGGGAATTCAAAGTCAATGGCG 

GTTCGGTTTCGGTTTGGGTTCCTAGAAAAACGACCGTTTCTACCATCGCTCGGCC 

GATCACAACCCGACCGTGGACTGGT 

GAATTCGTCCGTTGGACCGAACCACGGTTGGTGGCATGGCCTTGA 

tgcctgcga 

SEQ ID No. 6 

AAPFNGTMMQYFEWYLPDDGTLWTKVANEANNLSSLGITA 

LWLPPAYKGTSRSDVGYGVYDLYDLGEFNQKGTVRTKYGT 

KAQYLQAIQAAHAAGMQVYADWFDHKGGADGTEWVDAVE 

VNPSDRNQEISGTYQIQAWTKFDFPGRGNTYSSFKWRWYH 

FDGVDWDESRKLSRIYKFRGIGKAWDWEVDTENGNYDYLM 

YADLDMDHPEWTELKNWGKWYVNTTNIDGFRLDAVKHIK 

FSFFPDWLSYVRSQTGKPLFTVGEYWSYDINKLHNYITKT 

DGTMSLFDAPLHNKFYTASKSGGAFDMRTLMTNTLMKDQP 

TLAVTFVDNHDTEPGQALQSWVDPWFKPLAYAFILTRQEG 

YPCVFYGDYYGIPQYNIPSLKSKIDPLLIARRDYAYGTQH 

DYLDHSDIIGWTREGGTEKPGSGLAALITDGPGGSKWMYV 

GKQHAGKVFYDLTGNRSDTVTINSDGWGEFKVNGGSVSVW 

VPRKTTVSTIARPITTRPWTGEFVRWTEPRLVAW 
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SEQ ID No. 10 
1 ATPADWRSQS IYFLLTDRFA RTDGSTTATC 
31 NTADQKYCGG TWQGHDKLD YIQGMGFTAI 
61 WITPVTAQLP QTTAYGDAYH GYWQQDIYSL 
91 NENYGTADDL KALSSALHER GMYLMVDWA 
121 NHMGYDGAGS SVDYSVFKPF SSQDYFHPFC 
151 FIQNYEDQTQ VEDCWLGDNT VSLPDLDTTK 
181 DVVKNEWYDW VGSLVSNYSI DGLRIDTVKH 
211 VQKDFWPGYN KAAGVYCIGE VLDGDPAYTC 
241 PYQNVMDGVL NYPIYYPLLN AFKSTSGSMD 
271 DLYNMINTVK SDCPDSTLLG TFVENHDNPR 
301 FASYTNDIAL AKNVAAFIIL NDGIPIIYAG 
331 QEQHYAGGND PANREATWLS GYPTDSELYK 
361 LIASANAIRN YAISKDTGFV TYKNWPIYKD 
391 DITIAMRKGT DGSQIVTILS NKGASGDSYT 
421 LSLSGAGYTA GQQLTEVIGC TTVTVGSDGN 
451 VPVPMAGGLP RVLYPTEKLA GSKICSSS 
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CLAIMS 



1. A method of constructing a variant of a parent Termamyl-like a-amylase, which variant 
has a-amylase activity and at least one altered property as compared to said parent a- 

5 amylase, which method comprises 

i) analyzing the structure of the parent Termamyl-like a-amylase to identify 
at least one amino acid residue or at least one structural part of the Termamyl-like a-amylase 
structure, which amino acid residue or structural part is believed to be of relevance for 
altering said property of the parent Termamyl-like a-amylase (as evaluated on the basis of 
10 structural or functional considerations), 

r -i ii) constructing a Termamyl-like a-amylase variant, which as compared to the 

parent Termamyl-like a-amylase, has been modified in the amino acid residue or structural 
W part identified in i) so as to alter said property, and 

Pi iii) testing the resulting Termamyl-like a-amylase variant for said property. 

2. The method according to claim 1, wherein the property to be altered is selected from the 
fji group consisting of substrate specificity, substrate binding, substrate cleavage pattern, 
2 temperature stability, pH dependent activity, pH dependent stability (especially increased 
\i: stability at low (e.g. pH<6) or high (e.g. pH>9) pH values), stability towards oxidation, 

2 0 Ca 2+ -dependency and specific activity. 

3. The method according to claim 1, wherein the property to be altered is the calcium ion 
dependency and the structural part to be modified is selected from the group consisting of 
theX domain, the interface between the A and B domain, the interface between the A and 

25 C domain, or the interaction to a calcium binding site of the Termamyl-like a-amylase. 

4. The method according to claim 1, wherein the property to be altered is the substrate 
cleavage pattern and the structural part to be modified is located within 10 A from an amino 
acid residue of the substrate binding site. 

30 
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5, A method of constructing a variant of a parent Termamyl-like a-amylase, which variant 
has a-amylase activity and one or more altered properties as compared to said parent a- 
amylase, which method comprises 

i) comparing the three-dimensional structure of the Termamyl-like a-amylase 
5 with the structure of a non-Termamyl-like a-amylase, 

ii) identifying a part of the Termamyl-like a-amylase structure which is 
different from the non-Termamyl-like a-amylase structure and which from structural or 
functional considerations is contemplated to be responsible for differences in one or more 
properties of the Termamyl-like and non-Termamyl-like a-amylase, and 

10 iii) modifying the part of the Termamyl-like a-amylase identified in ii) 

P! whereby a Termamyl-like a-amylase variant is obtained, one or more properties of which 

differ from the parent Termamyl-like a-amylase. 

Ijl 6. The method according to claim 5, wherein, in step iii), the part of the Termamyl-like a- 

pi5 amylase is modified so as to resemble the corresponding part of the non-Termamyl-like a- 
fL amylase. 

i7| 7. The method according to claim 5, wherein, in step iii), the modification is accomplished 

% ~ by deleting one or more amino acid residues of the part of the Termamyl-like a-amylase to 

20 be modified; by replacing one or more amino acid residues of the part of the Termamyl-like 
a-amylase to be modified with the amino acid residues occupying corresponding positions 
in the non-Termamyl-like a-amylase; or by insertion of one or more amino acid residues 
present in the non-Termamyl-like a-amylase into a corresponding position in the Termamyl- 
like a-amylase. 

25 

8* The method according to claim 5, wherein the non-Termamyl-like a-amylase structure 
is the structure of a fungal a-amylase or a mammalian a-amylase. 

9. The method according to claim 8, wherein the non-Termamyl-like a-amylase is the 
3 0 Aspergillus oryzae TAKA a-amylase, the A. niger acid a-amylase, the Bacillus subtilis a- 
amylase or the pig pancreatic a-amylase. 
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10. The method according to claim 1, wherein the parent Termamyl-like a-amylase is 
derived from a strain of Bacillus. . . 

11. The method according to claim 10, wherein the parent a-amylase is derived from a 
strain of a B. licheniformis, B. amyloliquefaciens, B. stearothermophilus or a strain from an 
alkalophilic Bacillus sp. such as NCIB 12289, NCIB 12512 or NCIB 12513. 

12. The method according to claim 1, wherein the parent a-amylase is a hybrid a-amylase 
comprising a combination of partial amino acid sequences derived from at least two 
a-amylases, of which one is a Termamyl-like a-amylase and the other(s) are, e.g., from a 
microbial and/or a mammalian a-amylase. 

13. The method according to claim 5, wherein the part of the parent Termamyl-like a- 
amylase to be modified and identified in step ii) is loop 1, loop 2, loop 3 and/or loop 8 of 
the parent a-amylase. 

14. A method of constructing a variant of a parent Termamyl-like a-amylase, which has a 
decreased calcium ion dependency as compared to said parent, which method comprises: 

i) identifying an amino acid residue within 10 A from a Ca 2+ binding site of 
a Termamyl-like o:-amylase in a model of the three-dimensional structure of said a-amylase, 
which from structural or functional considerations is believed to be responsible for a non- 
optimal calcium ion interaction, 

ii) constructing a variant wherein said amino acid residue is replaced with 
another amino acid residue which from structural or functional considerations is believed to 
be important for establishing ajiigher Ca 2+ binding affinity, and 

iii) testing the Ca 2+ dependency of the resulting Termamyl-like a-amylase 

variant. 

15. A method of constructing a variant of a parent Termamyl-like a-amylase which variant 
has a-amylase activity and an altered pH dependent activity, which method comprises 

i) in a three-dimensional structure of the Termamyl-like a-amylase in question, 
identifying an amino acid residue within 15 A from an active site residue, in particular 10_ 
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A from an active site residue, which amino acid residue is contemplated to be involved in 
electrostatic or hydrophobic interactions .with an active site residue, 

ii) replacing, in the structure, said amino acid residue with an amino acid 
residue which changes the electrostatic and/or hydrophobic surroundings of an active site 
residue and evaluating the accommodation of the amino acid residue in the structure, 

iii) optionally repeating step i) and/or ii) until an amino acid replacement has 
been identified which is accommodated into the structure, 

iv) constructing a Termamyl-like a-amylase variant resulting from steps i), ii) 
and optionally iii) and testing the pH dependent activity of said variant. 

16. A method of increasing the thermostability and/or altering the temperature optimum of 
a parent Termamyl-like a-amylase, which method comprises 

i) identifying an internal hole or a crevice of the parent Termamyl-like a- 
amylase in the three-dimensional structure of said a-amylase, 

ii) replacing, in the structure, one or more amino acid residues in the 
neighborhood of the hole or crevice identified in i) with another amino acid residue which 
from structural or functional considerations is believed to increase the hydrophobic 
interaction and to fill out or reduce the size of the hole or crevice, 

iii) constructing a Termamyl-like a-amylase variant resulting from step ii) and 
testing the thermostability and/or temperature optimum of the variant. 

17. A method of constructing a variant of a Termamyl-like a-amylase which has a reduced 
ability to cleave a substrate close to the branching point, which method comprises 

i) identifying the substrate binding area of the parent Termamyl-like a-amylase 
in a model of the three-dimensional structure of said a-amylase, 

ii) replacing, in the model, one or more amino acid residues of the substrate 
binding area of the cleft identified in i), which is/are believed to be responsible for the 
cleavage pattern of the parent a-amylase, with another amino acid residue which from 
structural considerations is believed to result in an altered substrate cleavage pattern, or 
deleting one or more amino acid residues of the substrate binding area contemplated to 
introduce favorable interactions to the substrate or adding one or more amino acid residues 



-79- 



to the substrate binding area contemplated to introduce favorable interactions to the substrate, 
and 

iii) constructing a Termamyl-like a-amylase variant resulting from step ii) and 
testing the substrate cleavage pattern of the variant. 

18. The method according to claim 1, wherein the a-amylase variant is obtained by 
cultivating a microorganism comprising a DNA sequence encoding the variant under 
conditions which are conducive for producing the variant, and optionally subsequently 
recovering the variant from the resulting culture broth. 

19. A variant of a parent Termamyl-like a-amylase, in which variant, at least one amino 
acid residue of the parent a-amylase, which is/are present in a fragment corresponding to the 
amino acid fragment 44-57 of the amino acid sequence of SEQ ID No. 4, has been deleted 
or replaced with one or more amino acid residues which is/are present in a fragment 
corresponding to the amino acid fragment 66-84 of the amino acid sequence shown in SEQ 
ID No. 10, or in which variant, one or more additional amino acid residues has been added 
using the relevant part of SEQ ID No. 10 or a corresponding part of another Fungamyl-like 
a-amylase as a template. 

20. A variant of a parent Termamyl-like a-amylase, which variant has a region which, when 
the amino acid sequence of variant is aligned most closely with the amino acid sequence of 
the said parent a-amylase, occupies the same position as the portion from residue X to 
residue Y of SEQ ID No 4, the said region having at least 80% sequence homology with the 
part of SEQ ID No 10 extending from residue Z to residue V of SEQ ID No 10, wherein 

X is the amino acid residue occupying position 44, 45, 46, 47 or 48 of SEQ 

ID No. 4, 

Y is the amino acid residue occupying position 51, 52, 53, 54, 55, 56 or 57 
of SEQ ID No. 4, 

Z is the amino acid residue occupying position 66, 67, 68, 69 or 70 of SEQ 
ID No. 10, and 

V is the amino acid residue occupying position 78, 79, 80, 81, 82, 83 or 84 
of SEQ ID No. 10. 



- 80 - 



21. The variant according to claim 19, wherein X is the amino acid residue occupying 
position 48 and Y the amino acid residue occupying position 51 of SEQ ID NO 4 and Z is 
the amino acid residue occupying position 70 and V the amino acid residue occupying 
position 78 in SEQ ID No 10. 

22. A variant of a parent Termamyl-like a-amylase, in which variant, at least one of the 
amino acid residues of the parent a-amylase, which is/are present in an amino acid fragment 
corresponding to the amino acid fragment 195-202 of the amino acid sequence of SEQ ID 
No. 4, has been deleted or replaced with one or more of the amino acid residues which is/are 
present in an amino acid fragment corresponding to the amino acid fragment 165-177 of the 
amino acid sequence shown in SEQ ID No. 10, or in which one or more additional amino 
acid residues has been added using the relevant part of SEQ ID No. 10 or a corresponding 
part of another Fungamyl-like a-amylase as a template. 

23 . A variant of a parent Termamyl-like a-amylase, which variant has a region which, when 
the amino acid sequence of variant is aligned most closely with the amino acid sequence of 
the said parent a-amylase, occupies the same position as the portion from residue X to 
residue Y of SEQ ID No 4, the said region having at least 80%, such as 90% sequence 
homology with the part of SEQ ID No 10 extending from residue Z to residue V of SEQ ID 
No 10, wherein 

X is the amino acid occupying position 195 or 196 of SEQ ID No. 4, 

Y is the amino acid residue occupying position 198, 199, 200, 201, or 202 of 
SEQ ID No. 4, 

Z is the amino acid residue occupying position 165 or 166 of SEQ ID No. 10, 

and 

V is the amino acid residue occupying position 173, 174, 175, 176 or 177 of 
SEQ ID No. 10. , 

24. The variant according to claim 22, wherein the amino acid fragment of the parent a- 
amylase, which corresponds to amino acid residues 196-198 of SEQ ID No. 4, has been 
replaced with the amino acid fragment corresponding to amino acid residues 166-173 of the 
amino acid sequence shown in SEQ ID No. 10. 
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25. A variant of a parent Termamyl-like a-amylase, in which variant, at least one of the 
amino acid residues of the parent a-amylase, which is/are present in a fragment 
corresponding to the amino acid fragment 117-185 of the amino acid sequence of SEQ ID 
No. 4, has/have been deleted or replaced with one or more of the amino acid residues, which 
is/are present in an amino acid fragment corresponding to the amino acid fragment 98-210 
of the amino acid sequence shown in SEQ ID No. 10, or in which one or more additional 
amino acid residues has been added using the relevant part of SEQ ID No. 10 or a 
corresponding part of another Fungamyl-like a-amylase as a template. 

26. A variant of a parent Termamyl-like a-amylase, which variant has a region which, 
when the amino acid sequence of variant is aligned most closely with the amino acid 
sequence of the said parent a-amylase, occupies the same position as the portion from residue 
X to residue Y of SEQ ID No 4, the said region having at least 80%, such as at least 90% 
sequence homology with the part of SEQ ID No 10 extending from residue Z to residue V 
of SEQ ID No 10, wherein 

X is the amino acid occupying position 117, 118, 119, 120 or 121 of SEQ ID 

No. 4, 

Y is the amino acid occupying position 181, 182, 183, 184 or 185 of SEQ ID 

No. 4, 

Z is the amino acid occupying position 98, 99, 100, 101, 102 of SEQ ID No. 

10, and 

V is the amino acid occupying position 206, 207, 208, 209 or 210 of SEQ ID 

No. 10. 

27. The variant according to claim 25, wherein an amino acid fragment of the parent a- 
amylase, which corresponds to amino acid residues 121-181 of SEQ ID No. 4, has been 
replaced with the amino acid fragment corresponding to amino acid residues 102-206 of the 
amino acid sequence shown in SEQ ID No. 10. 

28. A variant of a parent Termamyl-like a-amylase, in which variant, at least one of the 
amino acid residues of the parent a-amylase, which is/are present in a fragment 
corresponding to the amino acid fragment 117-181 of the amino acid sequence of SEQ ID 
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No. 4, has/have been deleted or replaced with one or more of the amino acid residues, which 
is/are present in an amino acid fragment corresponding to the amino acid fragment to 98-206 
of the amino acid sequence shown in SEQ ID No. 10, or in which one or more additional 
amino acid residues has been added using the relevant part of SEQ ID No. 10 or a 
5 corresponding part of another FungamyHike a-amylase as a template. 

29. A variant of a parent Termamyl-like a-amylase, which variant has a region which, when 
the amino acid sequence of variant is aligned most closely with the amino acid sequence of 
the said parent a-amylase, occupies the same position as the portion from residue X to 

10 residue Y of SEQ ID No 4, the said region having at least 80%, such as at least 90% 
sequence homology with the part of SEQ ID No 10 extending from residue Z to residue V 
of SEQ ID No 10, wherein 

X is the amino acid occupying position 117, 118, 119, 120 or 121 of SEQ ID 

No. 4, 

15 Y is the amino acid occupying position 174, 175, 176 or 177 of SEQ ID No. 

4, 

Z is the amino acid occupying position 98, 99, 100, 101, 102 of SEQ ID No. 

10, and 

V is the amino acid occupying position 199, 200, 201 or 202 of SEQ ID No. 

2 0 10. 

30. The variant according to claim 28, wherein the amino acid fragment of the parent a- 
amylase, which corresponds to amino acid residues 121-174 of SEQ ID No. 4, has been 
replaced with the amino acid fragment corresponding to amino acid residues 102-199 of the 

2 5 amino acid sequence shown in SEQ ID No. 10. 

31. A variant of a parent Termamyl-like a-amylase, in which variant, at least one of the 
amino acid residues of the parent a-amylase, which is/are present in an amino acid fragment 
corresponding to the amino acid fragment 12-19 of the amino acid sequence of SEQ ID No. 

3 0 4, has/have been deleted or replaced with one or more of the amino acid residues, which 

is/are present in an amino acid fragment which corresponds to the amino acid fragment 28-42 
of SEQ ID No. 10, or in which one or more additional amino acid residues has/have been 
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inserted using the relevant part of SEQ ID No. 10 or a corresponding part of another 
Fungamyl-like a-amylase as a template. 

32. A variant of a parent Termamyl-like a-amylase , which variant has a region which, when 
the amino acid sequence of variant is aligned most closely with the amino acid sequence of 
the said parent a-amylase, occupies the same position as the portion from residue X to 
residue Y of SEQ ID No 4, the said region having at least 80%, such as at least 90% 
sequence homology with the part of SEQ ID No 10 extending from residue Z to residue V 
of SEQ ID No 10, wherein 

X is the amino acid occupying position 12, 13 or 14 of SEQ ID No. 4, 

Y is the amino acid occupying position 15, 16, 17, 18 or 19 of SEQ ID No. 

4, 

Z is the amino acid occupying position 28, 29, 30, 31 or 32 of SEQ ID No. 

10, and 

V is an amino acid residue corresponding to the amino acid occupying position 
38, 39, 40, 41 or 42 of SEQ ID No. 10. 

33. The variant according to claim 31, wherein the amino acid fragment of the parent a- 
amylase, which corresponds to amino acid residues 14-15 of SEQ ID No. 4, has been 
replaced with the amino acid fragment corresponding to amino acid residues 32-38 of the 
amino acid sequence shown in SEQ ID No. 10. 

34. A variant of a parent Termamyl-like a-amylase, in which variant, at least one of the 
amino acid residues of the parent a-amylase, which is present in a fragment corresponding 
to amino acid residues 7-23 of the amino acid sequence of SEQ ID No. 4, has/have been 
deleted or replaced with one or more amino acid residues, which is/are present in an amino 
acid fragment corresponding to amino acid residues 13-45 of the amino acid sequence shown 
in SEQ ID No. 10, or in which one or more additional amino acid residues has/have been 
inserted using the relevant part of SEQ ID No. 10 or a corresponding part of another 
Fungamyl-like a-amylase as a template. 
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35. A variant of a parent Termamyl-like a-amylase, which variant has a region which, when 
the amino acid sequence of variant is aligned most closely with the amino acid sequence of 
the said parent a-amylase, occupies the same position as the portion from residue X to 
residue Y of SEQ ID No 4, the said region having at least 80%, such as at least 90% 

5 sequence homology with the part of SEQ ID No 10 extending from residue Z to residue V 
of SEQ ID No 10, wherein 

X is the amino acid occupying position 7 or 8 of SEQ ID No. 4, 

Y is the amino acid occupying position 18, 19, 20, 21, 22 or 23 of SEQ ID 

No. 4, 

10 Z is the amino acid occupying position 13 or 14 of SEQ ID No. 10, and 

V is the amino acid occupying position 40, 41, 42, 43, 44 or 45 of SEQ ID 

No. 10. 

36. The variant according to claim 34, wherein the amino acid fragment of the parent a- 
15 amylase, which corresponds to amino acid residues 8-18 of SEQ ID No. 4, has been replaced 

with the amino acid fragment corresponding to amino acid residues 14-40 of the amino acid 
sequence shown in SEQ ID No. 10. 

37. A variant of a parent Termamyl-like a-amylase, in which variant, at least one of the 
2 0 amino acid residues of the parent o:-amylase, which is present in a fragment corresponding 

to amino acid residues 322-346 of the amino acid sequence of SEQ ID No. 2, has/have been 
deleted or replaced with one or more amino acid residues, which is/are present in an amino 
acid fragment corresponding to amino acid residues 291-313 of the amino acid sequence 
shown in SEQ ID No. 10, or in which one or more additional amino acid residues has/have 

2 5 been inserted using the relevant part of SEQ ID No. 10 or a corresponding part of another 

Fungamyl-Iike a-amylase as a template. 

38. A variant of a parent Termamyl-like a-amylase, which variant has a region which, when 
the amino acid sequence of variant is aligned most closely with the amino acid sequence of 

3 0 the said parent a-amylase, occupies the same position as the portion from residue X to 

residue Y of SEQ ID No 2, the said region having at least 80% sequence homology with the 
part of SEQ ID No 10 extending from residue Z to residue V of SEQ ID No 10, wherein 
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X is the amino acid occupying position 322, 323, 324 or 325 of SEQ ID No. 

2, 

Y is the amino acid occupying position 343, 344, 345 or 346 of SEQ ID No. 

2, 

Z is the amino acid occupying position 291, 292, 293 or 294 of SEQ ID No. 

10, and 

V is the amino acid occupying position 310, 311, 312 or 313 of SEQ ID No. 

10. 

39. The variant according to claim 37, wherein the amino acid fragment of the parent a- 
amylase, which corresponds to amino acid residues 325-345 of SEQ D No. 2, has been 
replaced with the amino acid fragment corresponding to amino acid residues 294-313 of the 
amino acid sequence shown in SEQ ID No. 10. 

40. A variant of a parent FungamyHike a-amylase, in which variant, at least one of the 
amino acid residues of the parent a-amylase, which is/are present in an amino acid fragment 
corresponding to amino acid residues 291-313 of the amino acid sequence of SEQ ID No. 
10, has/have been deleted or replaced with one or more of the amino acid residues, which 
is/are present in an amino acid fragment corresponding to amino acid residues 98-210 of the 
amino acid sequence shown in SEQ ID No. 4, or in which one or more additional amino acid 
residues has/have been inserted using the relevant part of SEQ ID No. 4 or a corresponding 
part of another Termamyl-like a-amylase as a template. 

4L A variant of a parent Fungamyl-like a-amylase, which variant has a region which, when 
the amino acid sequence of variant is aligned most closely with the amino acid sequence of 
the said parent a-amylase, occupies the same position as the portion from residue X to 
residue Y of SEQ ID No 10, the said region having at least 80%, such as at least 90% 
sequence homology with the part of SEQ ID No 10 extending from residue Z to residue V 
of SEQ ID No 4, wherein 

X is the amino acid occupying position 117, 118, 119, 120 or 121 of SEQ ID 

No. 10, 
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Y is the amino acid occupying position 181, 182, 183 , 184 or 185 of SEQ ID 

No. 10, 

Z is the amino acid occupying position 98, 99, 100, 101 or 102 of SEQ ID 

No. 4, and 

5 V is the amino acid occupying position 206, 207, 208, 209 or 210 of SEQ ID 

No. 4. 

42. The variant according to claim 40, wherein the amino acid fragment of the parent a- 
amylase, which corresponds to amino acid residues 121-181 of SEQ ID No. 10, has been 

10 replaced with the amino acid fragment corresponding to amino acid residues 102-206 of the 
amino acid sequence shown in SEQ ID No. 4. 

43. A variant according to claim 40, wherein the amino acid fragment of the parent a- 
amylase, which corresponds to amino acid residues 121-174 of SEQ ID No. 10, has been 

15 replaced with the amino acid fragment corresponding to amino acid residues 102-199 of the 
amino acid sequence shown in SEQ ID No. 4. 

44. A variant of a parent Fungamyl-like a-amylase, wherein an amino acid fragment 
corresponding to amino acid residues 181-184 of the amino acid sequence shown in SEQ ID 

2 0 No. 10 has been deleted. 

45. A variant of a parent Termamyl-like a-amylase, which exhibits a-amylase activity and 
which has a decreased Ca 2+ dependency as compared to the parent a-amylase. 

2 5 46. A variant according to claim 45, which comprises a mutation in a position corresponding 

to at least one of the following positions in SEQ ID NO 2: 

N104, A349, 1479, L346, 1430, N457, K385, F350, 1411, H408 or G303, in particular a 

mutation corrsponding to 

N104D; 

3 0 A349C+I479C; 

L346C+I430C; 
N457D,E; 
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N457D,E+K385R; 

F350D,E+I430R,K; . . 

F350D,E+I411R,K; 
H408Q,E,N,D; and/or 
G303N,D,Q,E. 

47. A variant of a parent Termamyl-like a-amylase which exhibits a higher activity below 
the pH optimum than the parent a-amylase, which variant comprises a mutation of an amino 
acid residue corresponding to at least one of the following positions of the B. licheniformis 
a-amylase (SEQ ID NO 2): E336, Q333, P331, 1236, V102, A232, 1103, L196, in particular 
at least one of the following mutations: 

E336R,K; 

Q333R.K; P331R,K; 

V102R,K,A,T,S,G; 

I236K,R,N; 

I103K.R; 

L196K,R; and/or 

A232T,S,G. 

48. A variant of a parent Termamyl-like a-amylase which exhibits a higher activity above 
the pH optimum than the parent a-amylase, which variant comprises a mutation of an amino 
acid residue corresponding to at least one of the following positions of the B. licheniformis 
a-amylase (SEQ ID NO 2): N236, H281 and/or Y273, in particular one of the following 
mutations: 

N326I,Y,F,L,V; 
H281F.I.L; and/or 
Y273F,W. 

49. A variant of a parent Termamyl-like a-amylase which exhibits a-amylase activity and 
which has an increased thermostability and/or altered temperature optimum as compared to 
the parent a-amylase, which variant comprises a mutation of an amino acid residue 
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corresponding to at least one of the following positions of the B. licheniformis a-amylase 
(SEQ ID NO 2): 

L61, Y62, F67, K106, G145, 1212, S151, R214, Y150, F143, R146, L241, 1236, L7, V259, 

F284, F350, F343, L427 and/or V481, in particular at least one of the following mutations: 
5 L61W.V.F; 

Y62W; 

F67W; 

K106R,F,W; 

G145F.W 
10 I212F,L,W,Y,R,K; 

S151 replaced with any other amino acid residue and in particular with F,W,I or L; 

R214W; 

Y150R,K; 

F143W; 
15 R146W; 

L241I,F,Y,W; 

I236L,F,W,Y; 

L7F.I.W; 

V259F,I,L; 

2 0 F284W; 

F350W; 
F343W; 

L427F,L,W; and/or 
V481,F,I;L,W. 

25 

50. A variant of a parent Termamyl-like a-amylase, which exhibits a-amylase activity and 
which has a reduced capability of cleaving an oligo-saccharide substrate close to the 
branching point as compared to the parent a-amylase, which variant comprises a mutation 
of an amino acid residue corresponding to at least one of the following positions of the B. 

3 0 licheniformis a-amylase (SEQ ID NO 2): 

V54, D53, Y56, Q333 and/or G57, in particular at least one of the following mutations: 
V54L,I,F,Y,W,R,K,H,E,Q; 
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D53L,I,F,Y,W; 
Y56W; 

Q333W; and/or 

G57 to all possible amino acid residues. 

51. The variant according to claim 17, wherein one or more proline residues present in the 
amino acid residues with which the parent a-amylase is modified are replaced with a non- 
proline residue such as alanine. 

52. The variant according to claim 17, wherein one or more cysteine residues present in the 
amino acid residues with which the parent a-amylase is modified are replaced with a non- 
cysteine residue such as alanine. 

53. A DNA construct comprising a DNA sequence encoding an a-amylase variant according 
to claim 17. 

54. A recombinant expression vector which carries a DNA construct according to claim 53. 

55. A cell which is transformed with a DNA construct according to claim 53. 

56. A cell according to claim 55, which is a microorganism. 

57. A cell according to claim 56, which is a bacterium or a fungus. 

58. The cell according to claim 57, which is a gram positive bacterium such as Bacillus 
subtilis, Bacillus licheniformis. Bacillus lentus, Bacillus brevis, Bacillus stearothermophilus, 
Bacillus alkalophUus, Bacillus amyloliquefaciens, Bacillus coagulans, Bacillus circulans, 
Bacillus lautus or Bacillus thuringiensis. 

59. Use of an a-amylase variant according to claim 17 for washing and/or dishwashing. 

60. Use of an a-amylase variant according to claim 17 for desizing. 
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61. Use of an a-amylase variant according to claim 17 for starch liquefaction. 

62. A detergent additive comprising an a-amylase variant according to claim 17, optionally 
in the form of a non-dusting granulate, stabilized liquid or protected enzyme. 

5 

63. A detergent additive according to claim 62 which contains 0.02-200 mg of enzyme 
protein/g of the additive. 

64. A detergent additive according to claim 62, which additionally comprises another 
10 enzyme such as a protease, a lipase, a peroxidase, another amylolytic enzyme and/or a 

cellulase. 

65. A detergent composition comprising an a-amylase variant according to claim 17. 

15 66. A detergent composition according to claim 65 which additionally comprises another 
enzyme such as a protease, a lipase, a peroxidase, another amylolytic enzyme and/ or a 
cellulase. 

67. A manual or automatic dishwashing detergent composition comprising an a-amylase 

2 0 variant according to claim 17. 

68. A dishwashing detergent composition according to claim 67 which additionally 
comprises another enzyme such as a protease, a lipase, a peroxidase, another amylolytic 
enzyme and/or a cellulase. 

25 

69. A manual or automatic laundry washing composition comprising an a-amylase variant 
according to claim 17. 

70. A laundry washing composition according to claim 69, which additionally comprises 

3 0 another enzyme such as a protease, a lipase, a peroxidase, an amylolytic enzyme and/or a 

cellulase. 
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ABSTRACT 

The present invention relates to a method of constructing a variant of a parent 
Termamyl-like a-amylase, which variant has a-amylase activity and at least one altered 
property as compared to the parent a-amylase, comprising i) analyzing the structure of the 
5 parent Termamyl-like a-amylase to identify at least one amino acid residue or at least one 
structural part of the Termamyl-like a-amylase structure, which amino acid residue or 
structural part is believed to be of relevance for altering the property of the parent 
Termamyl-like a-amylase (as evaluated on the basis of structural or functional 
considerations), ii) constructing a Termamyl-like a-amylase variant, which as compared to 
10 the parent Termamyl-like a-amylase, has been modified in the amino acid residue or 
structural part identified in i) so as to alter the property, and iii) testing the resulting 
Termamyl-like a-amylase variant for the property in question. 
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COMBINED DECLARATION FOR PATENT APPLICATION AND POWER OF ATTORNEY 

{Includes Reference to PCT International Applications) 



Attorney's Docket Number. 

4394 . 204-US 



As a below named inventor, I hereby declare that: 



My residence, post office address and citizenship are as stated below next to my name. 

I believe l am the original, first and sole inventor (if only one name is listed below) 
or an original, first and joint inventor (if plural names are listed below) of the 
subject matter which is claimed and for which a patent is sought on the invention 
entitled: 

a- Amylase Mutants 



the specification of which {check only one item below) : 
[ ] is attached hereto 

[X] was filed as United States application 

Serial No. to be assigned 

on February 13, 1996 



and was amended 

on 

[ ] was filed as PCT international application 
Number 



on 

and was amended under PCT Article 19 
on 



I hereby state that I have reviewed and understand the contents of the above- identified 
specification, including the claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose information which is material to the examination of 
this application in accordance with Title 37, Code of Federal Regulations, §1.56. 

I hereby claim foreign priority benefits under Title 35, United States Code, §119 of any 
foreign applications (s) for patent or inventor's certificate or of any PCT international 
application (s) designating at least one country other than the United States of America 
listed below and have also identified below any foreign applications (s) for patent or 
inventor's certificate or any PCT international application ( s ) designating at least one 
country other than the United States of America filed by me on the same subject matter 
having a filing date before that of the application ( s ) of which priority is claimed: 



PRIOR FORSIGN/PCT APPLICATION (S) AND ANY PRIORITY CLAIMS UNDER 35 U.S.C. 119: 



COUNTRY 
(if PCT, indicate "PCT") 


APPLICATION NUMBER 


DATE OF FILING 
(day, month, year) 


PRIORITY CLAIMED 
UNDER 35 USC 119 


Denmark 


0128/95 


February 3, 1995 


[x] YES [ ] NO 


Denmark 


1192/95 


October 23, 1995 


[x] YES E ] NO 


Denmark 


1256/95 


November 10, 1995 


[x] YES { ] NO 








[ ] YES [ ] NO 








[ ] YES [ 3 NO 








[ ] YES [ 7 "NO 
l 
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(Includes Reference to PCT International Applications) 



Attorney' 3 Docket Number 
4394 . 204-US 



L^"^. Cla ^ m 1 . be ^fit under Title 35, United States Code §120 of any United States application (s) or PCT 
« atl °" al application is) designating the United States of America that is/are listed below and, insofar 

l e °s ■ ° f Sach of .5 he claims of this applications is not disclosed in that/those pr: 

application <s) in the manner provided by the first paragraph of Title 35, United States Code, §112 
"™ i f d ^ th u e dut Y to disclose material information as defined in Title 37, Code of Federal Requlat ' 

fi;^?L^w.? Crar I e ? H tW t en th *. filin <3 d * te of the prior application (s) and the national or 
international filing date of this aoolication: 



10ns , 
or PCT 



PRIOR U.S. APPLICATIONS OR PCT INTERNATIONAL APPLICATIONS DESIGNATING THE U.S. FOR BENEFIT 



U.S. APP] 


.•I CATIONS 


STATUS (Check one) 


U.S APPLICATION NUMBER 


U.S. FILING DATE 


Patented 


Pending 


Abandoned 






























































PCT 


APPLICATIONS DESIGNATING THE U.S. 








APPLICATION NO. 


FILING DATE 


US SERIAL NUMBER 

ASSIGNED (if any} 








12 PCT/DK96/00OS7 


February 5, 1996 
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POWER OF ATTORNEY: As a named inventor, I hereby appoint the following attorney(s) and/or agent (s) to prosecute 
this application and transact all business in the Patent and Trademark Office connected therewith. 
Steve T. Zelson Elias J. Lambiris Cheryl H. Agris Karen A. Lowney James J. Harrington 
Reg. No. 30,335 Reg. No. 33,728 Reg. No. 34,08 6 Reg. No. 31,274 Reg. No. 38,711 



Send Correspondence to: 



Steve T. Zelson, Esq. 
Novo Nordisk of North America, Inc. 
4 05 Lexington Avenue, Suite 6400 
New York, New York 10174-6401 



Direct Telephone Calls To: 



Steve T. Zelson 
(212) 867-0123 



Full Name 
of Inventor 



Residence & 
Citizenship 



Family Mama 

Svendsen 



DK-34S0 Birkeroed 



First Oiv«n Name 

Allan 



Saeond Oivor 



Stata or Poraign Country 

Denmark 



Country of Cltisei 

Denmark 



Post Office 
Address 



Poat o££tc« Addreoa 

Bakkeleddet 2 8 



City 

DK-34S0 Birkeroed 



State £ Zip Cod* /Covin try 

Denmark 



Full Name 
of Inventor 



family Nana 

Bisgard-Frantzen 



?irat aivea Kj 

Henrik 



Saeond CI van Kant* 



Residence & 
Citizenship 



City 

DK-2800 Lyngby 



Scat* or Poraign Country 

Denmark 



Country ot Citizam 

Denmark 



Post Office 
Add re ss ' 



Poat Offico Addro.a 

Sandkrogen 2 7 



City 

DK-2 8 00 Lyngby 



scat* s Zip co<WCounery 

Denmark 



Full Name 
of Inventor 



Family Mam* 

Borchert 



?trst Given Nai 

Torben 



5»cond Oivon Kama 

Vedel 



Residence & 
Citizenship 



DK-220 0 Copenhagen N 



seat* or Foreign Country 

Denmark 



Country of Cltlz»n*hip 

Denmark 



Post Office 
Address 



Po«t Offic* Addraaa^ 

Dagmarsgade 34, lejl. 17 



DK-22 0 0 Copenhagen N 



Stat* i Zip Cod«/Couacry 

Denmark 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on 
information and belief are believed to be true; and further that these statements were made with the knowledge 
that willful false statements and the like so made are punishable by fine or imprisonment, or both, under 
section 1001 of Tide 18 of the United States Code, and that such willful false statements may jeopardize the 
validity of the application or any patent issuing thereon. 
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Attorney Docket No.: 4394.214-US 



PATENT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re Application of: Svendsen et al. 

Serial No.: To be assigned Group Art Unit: To be assigned 

Filed: June 3, 1999 Examiner: To be assigned 

For: a- Amylase Mutants 

REQUEST TRANSFER OF COMPUTER READABLE 
SEQUENCE LISTING FROM PARENT CASE 



Assistant Commissioner for Patents 
Washington, DC 20231 



Sir: 

The computer-readable form in this application is identical with that filed in 
Application Serial No. 08/600.908 filed February 13, 1996. In accordance with 37 CFR 
1.821(e), please use the last filed computer readable form filed in that application as the 
computer readable form for the instant application. It is understood that the Patent and 
Trademark Office will make the necessary change in application number and filing date for 
the computer readable form that will be used for the instant application. A paper copy of the 
Sequence Listing is included in a separately filed preliminary amendment for incorporation 
into the specification. 

The Examiner is hereby to contact the undersigned if there are any questions 
concerning this response. 

Respectfully submitted, 



Date: June 3, 1999 




fReza Green, Reg. No. 38,475 
Novo Nordisk of North America, Inc. 
405 Lexington Avenue, Suite 6400 
New York, NY 10174-6401 
(212) 867-0123 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: S vends en, Allan 

Bisgard-Frantzen, Henrik 
Borchert, Torben Vedel 

(ii) TITLE OF INVENTION: a- Amylase Mutants 

(iii) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Novo Nordisk of North America, Inc. 

(B) STREET: 4 05 Lexington Avenue, 64th Floor 

(C) CITY: New York 

(D) STATE: New York 

(E) COUNTRY : United States of America 

(F) ZIP: 10174-6401 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM; PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 03-JUN-1999 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Green, Reza 

(B) REGISTRATION NUMBER : 38,475 

(C) REFERENCE/DOCKET NUMBER: 4394.214-US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 212-867-0123 

(B) TELEFAX: 212-878-9655 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 334.. 1869 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 334.. 420 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 421.. 1869 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CGGAAGATTG GAAGTACAAA AATAAGCAAA AGATTGTCAA TCATGTCATG AGCCATGCGG 60 



69 



GAGACGGAAA AATCGTCTTA ATGCACGATA TTTATGCAAC GTTCGCAGAT GCTGCTGAAG 12 0 

AGATTATTAA AAAGCTGAAA GCAAAAGGCT ATCAATTGGT AACTGTATCT CAGCTTGAAG 180 

AAGTGAAGAA GCAGAGAGGC TATTGAATAA ATGAGTAGAA GCGCCATATC GGCGCTTTTC 24 0 

TTTTGGAAGA AAATATAGGG AAAATGGTAC TTGTTAAAAA TTCGGAATAT TTATACAACA 3 00 

TCATATGTTT CACATTGAAA GGGGAGGAGA ATC ATG AAA CAA CAA AAA CGG CTT 354 

Met Lys Gin Gin Lys Arg Leu 
-29 -25 

TAC GCC CGA TTG CTG ACG CTG TTA TTT GCG CTC ATC TTC TTG CTG CCT 4 02 

Tyr Ala Arg Leu Leu Thr Leu Leu Phe Ala Leu lie Phe Leu Leu Pro 
-20 -15 -10 

GAT TCT GCA GCA GCG GCG GCA AAT CTT AAT GGG ACG CTG ATG CAG TAT 450 
His Ser Ala Ala Ala Ala Ala Asn Leu Asn Gly Thr Leu Met Gin Tyr 
-5 15 10 

TTT GAA TGG TAC ATG CCC AAT GAC GGC CAA CAT TGG AGG CGT TTG CAA 4 98 

Phe Glu Trp Tyr Met Pro Asn Asp Gly Gin His Trp Arg Arg Leu Gin 
15 20 25 

AAC GAC TCG GCA TAT TTG GCT GAA CAC GGT ATT ACT GCC GTC TGG ATT 546 
Asn Asp Ser Ala Tyr Leu Ala Glu His Gly lie Thr Ala Val Trp lie 
30 35 40 

CCC CCG GCA TAT AAG GGA ACG AGC CAA GCG GAT GTG GGC TAC GGT GCT 594 
Pro Pro Ala Tyr Lys Gly Thr Ser Gin Ala Asp Val Gly Tyr Gly Ala 
45 50 55 

TAC GAC CTT TAT GAT TTA GGG GAG TTT CAT CAA AAA GGG ACG GTT CGG 642 
Tyr Asp Leu Tyr Asp Leu Gly Glu Phe His Gin Lys Gly Thr Val Arg 
60 65 70 

ACA AAG TAC GGC ACA AAA GGA GAG CTG CAA TCT GCG ATC AAA AGT CTT 69 0 

Thr Lys Tyr Gly Thr Lys Gly Glu Leu Gin Ser Ala lie Lys Ser Leu 
75 80 85 90 

CAT TCC CGC GAC ATT AAC GTT TAC GGG GAT GTG GTC ATC AAC CAC AAA 738 
His Ser Arg Asp lie Asn Val Tyr Gly Asp Val Val lie Asn His Lys 
95 100 105 

GGC GGC GCT GAT GCG ACC GAA GAT GTA ACC GCG GTT GAA GTC GAT CCC 786 
Gly Gly Ala Asp Ala Thr Glu Asp Val Thr Ala Val Glu Val Asp Pro 
110 115 120 

GCT GAC CGC AAC CGC GTA ATT TCA GGA GAA CAC CTA ATT AAA GCC TGG 834 
Ala Asp Arg Asn Arg Val lie Ser Gly Glu His Leu lie Lys Ala Trp 
125 130 135 

ACA CAT TTT CAT TTT CCG GGG CGC GGC AGC ACA TAC AGC GAT TTT AAA 882 
Thr His Phe His Phe Pro Gly Arg Gly Ser Thr Tyr Ser Asp Phe Lys 
140 145 150 

TGG CAT TGG TAC CAT TTT GAC GGA ACC GAT TGG GAC GAG TCC CGA AAG 93 0 

Trp His Trp Tyr His Phe Asp Gly Thr Asp Trp Asp Glu Ser Arg Lys 
155 160 165 170 

CTG AAC CGC ATC TAT AAG TTT CAA GGA AAG GCT TGG GAT TGG GAA GTT 978 
Leu Asn Arg lie Tyr Lys Phe Gin Gly Lys Ala Trp Asp Trp Glu Val 
175 180 185 

TCC AAT GAA AAC GGC AAC TAT GAT TAT TTG ATG TAT GCC GAC ATC GAT 1026 
Ser Asn Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp lie Asp 
190 195 200 
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TAT GAC CAT CCT GAT GTC GCA GCA GAA ATT AAG AGA TGG GGC ACT TGG 1074 
Tyr Asp His Pro Asp Val Ala Ala Glu lie Lys Arg Trp Gly Thr Trp 
205 210 215 

TAT GCC AAT GAA CTG CAA TTG GAC GGT TTC CGT CTT GAT GCT GTC AAA 112 2 

Tyr Ala Asn Glu Leu Gin Leu Asp Gly Phe Arg Leu Asp Ala Val Lys 
220 225 230 

CAC ATT AAA TTT TCT TTT TTG CGG GAT TGG GTT AAT CAT GTC AGG GAA 1170 
His lie Lys Phe Ser Phe Leu Arg Asp Trp Val Asn His Val Arg Glu 
235 240 245 250 

AAA ACG GGG AAG GAA ATG TTT ACG GTA GCT GAA TAT TGG CAG AAT GAC 1218 
Lys Thr Gly Lys Glu Met Phe Thr Val Ala Glu Tyr Trp Gin Asn Asp 
255 260 265 

TTG GGC GCG CTG GAA AAC TAT TTG AAC AAA ACA AAT TTT AAT CAT TCA 1266 
Leu Gly Ala Leu Glu Asn Tyr Leu Asn Lys Thr Asn Phe Asn His Ser 
270 275 280 

GTG TTT GAC GTG CCG CTT CAT TAT CAG TTC CAT GCT GCA TCG ACA CAG 1314 
Val Phe Asp Val Pro Leu His Tyr Gin Phe His Ala Ala Ser Thr Gin 
285 290 295 

GGA GGC GGC TAT GAT ATG AGG AAA TTG CTG AAC GGT ACG GTC GTT TCC 1362 
Gly Gly Gly Tyr Asp Met Arg Lys Leu Leu Asn Gly Thr Val Val Ser 
300 305 310 

AAG CAT CCG TTG AAA TCG GTT ACA TTT GTC GAT AAC CAT GAT ACA CAG 1410 
Lys His Pro Leu Lys Ser Val Thr Phe Val Asp Asn His Asp Thr Gin 
315 320 325 330 

CCG GGG CAA TCG CTT GAG TCG ACT GTC CAA ACA TGG TTT AAG CCG CTT 145 8 

Pro Gly Gin Ser Leu Glu Ser Thr Val Gin Thr Trp Phe Lys Pro Leu 
335 340 345 

GCT TAC GCT TTT ATT CTC ACA AGG GAA TCT GGA TAC CCT CAG GTT TTC 15 06 

Ala Tyr Ala Phe lie Leu Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe 
350 355 360 

TAC GGG GAT ATG TAC GGG ACG AAA GGA GAC TCC CAG CGC GAA ATT CCT 1554 
Tyr Gly Asp Met Tyr Gly Thr Lys Gly Asp Ser Gin Arg Glu lie Pro 
365 370 375 

GCC TTG AAA CAC AAA ATT GAA CCG ATC TTA AAA GCG AGA AAA CAG TAT 16 02 

Ala Leu Lys His Lys lie Glu Pro lie Leu Lys Ala Arg Lys Gin Tyr 
380 385 390 

GCG TAC GGA GCA CAG CAT GAT TAT TTC GAC CAC CAT GAC ATT GTC GGC 16 50 

Ala Tyr Gly Ala Gin His Asp Tyr Phe Asp His His Asp lie Val Gly 
395 400 405 410 

TGG ACA AGG GAA GGC GAC AGC TCG GTT GCA AAT TCA GGT TTG GCG GCA 1698 
Trp Thr Arg Glu Gly Asp Ser Ser Val Ala Asn Ser Gly Leu Ala Ala 
415 420 425 

TTA ATA ACA GAC GGA CCC GGT GGG GCA AAG CGA ATG TAT GTC GGC CGG 1746 
Leu lie Thr Asp Gly Pro Gly Gly Ala Lys Arg Met Tyr Val Gly Arg 
430 435 440 

CAA AAC GCC GGT GAG ACA TGG CAT GAC ATT ACC GGA AAC CGT TCG GAG 1794 
Gin Asn Ala Gly Glu Thr Trp His Asp lie Thr Gly Asn Arg Ser Glu 
445 450 455 

CCG GTT GTC ATC AAT TCG GAA GGC TGG GGA GAG TTT CAC GTA AAC GGC 1842 
Pro Val Val lie Asn Ser Glu Gly Trp Gly Glu Phe His Val Asn Gly 
460 465 470 
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GGG TCG GTT TCA ATT TAT GTT CAA AGA TAGAAGAGCA GAGAGGACGG 18 8 9 

Gly Ser Val Ser lie Tyr Val Gin Arg 
475 480 

ATTTCCTGAA GGAAATCCGT TTTTTTATTT T 192 0 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 512 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Lys Gin Gin Lys Arg Leu Tyr Ala Arg Leu Leu Thr Leu Leu Phe 
-29 -25 -20 -15 

Ala Leu lie Phe Leu Leu Pro His Ser Ala Ala Ala Ala Ala Asn Leu 
-10 -5 1 

Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Met Pro Asn Asp Gly 
5 10 15 

Gin His Trp Arg Arg Leu Gin Asn Asp Ser Ala Tyr Leu Ala Glu His 
20 25 30 35 

Gly lie Thr Ala Val Trp lie Pro Pro Ala Tyr Lys Gly Thr Ser Gin 
40 45 50 

Ala Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr Asp Leu Gly Glu Phe 
55 60 65 

His Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Gly Glu Leu 
70 75 80 

Gin Ser Ala lie Lys Ser Leu His Ser Arg Asp lie Asn Val Tyr Gly 
85 90 95 

Asp Val Val lie Asn His Lys Gly Gly Ala Asp Ala Thr Glu Asp Val 
100 105 110 115 

Thr Ala Val Glu Val Asp Pro Ala Asp Arg Asn Arg Val lie Ser Gly 
120 125 130 

Glu His Leu lie Lys Ala Trp Thr His Phe His Phe Pro Gly Arg Gly 
135 140 145 

Ser Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe Asp Gly Thr 

150 155 160 

Asp Trp Asp Glu Ser Arg Lys Leu Asn Arg lie Tyr Lys Phe Gin Gly 
165 170 175 

Lys Ala Trp Asp Trp Glu Val Ser Asn Glu Asn Gly Asn Tyr Asp Tyr 
180 185 190 195 

Leu Met Tyr Ala Asp lie Asp Tyr Asp His Pro Asp Val Ala Ala Glu 
200 205 210 

lie Lys Arg Trp Gly Thr Trp Tyr Ala Asn Glu Leu Gin Leu Asp Gly 
215 220 225 
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Phe Arg Leu Asp Ala Val Lys His lie Lys Phe Ser Phe Leu Arg Asp 
230 235 240 

Trp Val Asn His Val Arg Glu Lys Thr Gly Lys Glu Met Phe Thr Val 

245 250 255 

Ala Glu Tyr Trp Gin Asn Asp Leu Gly Ala Leu Glu Asn Tyr Leu Asn 
260 265 270 275 

Lys Thr Asn Phe Asn His Ser Val Phe Asp Val Pro Leu His Tyr Gin 
280 285 290 

Phe His Ala Ala Ser Thr Gin Gly Gly Gly Tyr Asp Met Arg Lys Leu 
295 300 305 

Leu Asn Gly Thr Val Val Ser Lys His Pro Leu Lys Ser Val Thr Phe 
310 315 320 

Val Asp Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu Ser Thr Val 
325 330 335 

Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe lie Leu Thr Arg Glu 
340 345 350 355 

Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly Thr Lys Gly 
360 365 370 

Asp Ser Gin Arg Glu lie Pro Ala Leu Lys His Lys lie Glu Pro lie 
375 380 385 

Leu Lys Ala Arg Lys Gin Tyr Ala Tyr Gly Ala Gin His Asp Tyr Phe 
390 395 400 

Asp His His Asp lie Val Gly Trp Thr Arg Glu Gly Asp Ser Ser Val 
405 410 415 

Ala Asn Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro Gly Gly Ala 
420 425 430 435 

Lys Arg Met Tyr Val Gly Arg Gin Asn Ala Gly Glu Thr Trp His Asp 
440 445 450 

lie Thr Gly Asn Arg Ser Glu Pro Val Val lie Asn Ser Glu Gly Trp 
455 460 465 

Gly Glu Phe His Val Asn Gly Gly Ser Val Ser lie Tyr Val Gin Arg 
470 475 480 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 250 .. 1791 

(ix) FEATURE: 

(A) NAME /KEY : sig^peptide 

(B) LOCATION: 250.. 342 
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(ix) FEATURE: 

(A) NAME /KEY : mat_jpeptide 

(B) LOCATION: 343.. 1791 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



GCCCCGCACA TACGAAAAGA CTGGCTGAAA ACATTGAGCC TTTGATGACT GATGATTTGG 6 0 

CTGAAGAAGT GGATCGATTG TTTGAGAAAA GAAGAAGACC ATAAAAATAC CTTGTCTGTC 12 0 

ATCAGACAGG GTATTTTTTA TGCTGTCCAG ACTGTCCGCT GTGTAAAAAT AAGGAATAAA 18 0 

GGGGGGTTGT TATTATTTTA CTGATATGTA AAATATAATT TGTATAAGAA AATGAGAGGG 24 0 

AGAGGAAAC ATG ATT CAA AAA CGA AAG CGG ACA GTT TCG TTC AGA CTT 288 
Met lie Gin Lys Arg Lys Arg Thr Val Ser Phe Arg Leu 
-31 -30 -25 -20 



GTG CTT ATG TGC ACG CTG TTA TTT GTC AGT TTG CCG ATT ACA AAA ACA 3 36 

Val Leu Met Cys Thr Leu Leu Phe Val Ser Leu Pro lie Thr Lys Thr 
-15 -10 -5 

TCA GCC GTA AAT GGC ACG CTG ATG CAG TAT TTT GAA TGG TAT ACG CCG 3 84 

Ser Ala Val Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Thr Pro 
15 10 



AAC GAC GGC CAG CAT TGG AAA CGA TTG CAG AAT GAT GCG GAA CAT TTA 432 
Asn Asp Gly Gin His Trp Lys Arg Leu Gin Asn Asp Ala Glu His Leu 
15 20 25 30 

TCG GAT ATC GGA ATC ACT GCC GTC TGG ATT CCT CCC GCA TAC AAA GGA 480 
Ser Asp lie Gly lie Thr Ala Val Trp lie Pro Pro Ala Tyr Lys Gly 
35 40 45 

TTG AGC CAA TCC GAT AAC GGA TAC GGA CCT TAT GAT TTG TAT GAT TTA 528 
Leu Ser Gin Ser Asp Asn Gly Tyr Gly Pro Tyr Asp Leu Tyr Asp Leu 
50 55 60 



GGA GAA TTC CAG CAA AAA GGG ACG GTC AGA ACG AAA TAC GGC ACA AAA 5 76 

Gly Glu Phe Gin Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys 
65 70 75 

TCA GAG CTT CAA GAT GCG ATC GGC TCA CTG CAT TCC CGG AAC GTC CAA 624 
Ser Glu Leu Gin Asp Ala lie Gly Ser Leu His Ser Arg Asn Val Gin 
80 85 90 

GTA TAC GGA GAT GTG GTT TTG AAT CAT AAG GCT GGT GCT GAT GCA ACA 6 72 

Val Tyr Gly Asp Val Val Leu Asn His Lys Ala Gly Ala Asp Ala Thr 
95 100 105 110 

GAA GAT GTA ACT GCC GTC GAA GTC AAT CCG GCC AAT AGA AAT CAG GAA 720 
Glu Asp Val Thr Ala Val Glu Val Asn Pro Ala Asn Arg Asn Gin Glu 
115 120 125 

ACT TCG GAG GAA TAT CAA ATC AAA GCG TGG ACG GAT TTT CGT TTT CCG 768 
Thr Ser Glu Glu Tyr Gin lie Lys Ala Trp Thr Asp Phe Arg Phe Pro 
130 135 140 

GGC CGT GGA AAC ACG TAC AGT GAT TTT AAA TGG CAT TGG TAT CAT TTC 816 
Gly Arg Gly Asn Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe 
145 150 155 



GAC GGA GCG GAC TGG GAT GAA TCC CGG AAG ATC AGC CGC ATC TTT AAG 864 
Asp Gly Ala Asp Trp Asp Glu Ser Arg Lys lie Ser Arg lie Phe Lys 
160 165 170 

TTT CGT GGG GAA GGA AAA GCG TGG GAT TGG GAA GTA TCA AGT GAA AAC 912 
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Phe Arg Gly Glu Gly Lys Ala Trp Asp Trp Glu Val Ser Ser Glu Asn 
175 180 185 190 

GGC AAC TAT GAC TAT TTA ATG TAT GCT GAT GTT GAC TAC GAC CAC CCT 960 
Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Tyr Asp His Pro 
195 200 205 

GAT GTC GTG GCA GAG ACA AAA AAA TGG GGT ATC TGG TAT GCG AAT GAA 100 8 

Asp Val Val Ala Glu Thr Lys Lys Trp Gly lie Trp Tyr Ala Asn Glu 
210 215 220 

CTG TCA TTA GAC GGC TTC CGT ATT GAT GCC GCC AAA CAT ATT AAA TTT 10 56 

Leu Ser Leu Asp Gly Phe Arg lie Asp Ala Ala Lys His lie Lys Phe 
225 230 235 

TCA TTT CTG CGT GAT TGG GTT CAG GCG GTC AGA CAG GCG ACG GGA AAA 1104 
Ser Phe Leu Arg Asp Trp Val Gin Ala Val Arg Gin Ala Thr Gly Lys 
240 245 250 

GAA ATG TTT ACG GTT GCG GAG TAT TGG CAG AAT AAT GCC GGG AAA CTC 1152 
Glu Met Phe Thr Val Ala Glu Tyr Trp Gin Asn Asn Ala Gly Lys Leu 
255 260 265 270 

GAA AAC TAC TTG AAT AAA ACA AGC TTT AAT CAA TCC GTG TTT GAT GTT 120 0 

Glu Asn Tyr Leu Asn Lys Thr Ser Phe Asn Gin Ser Val Phe Asp Val 
275 280 285 

CCG CTT CAT TTC AAT TTA CAG GCG GCT TCC TCA CAA GGA GGC GGA TAT 1248 
Pro Leu His Phe Asn Leu Gin Ala Ala Ser Ser Gin Gly Gly Gly Tyr 
290 295 300 

GAT ATG AGG CGT TTG CTG GAC GGT ACC GTT GTG TCC AGG CAT CCG GAA 12 96 

Asp Met Arg Arg Leu Leu Asp Gly Thr Val Val Ser Arg His Pro Glu 
305 310 315 

AAG GCG GTT ACA TTT GTT GAA AAT CAT GAC ACA CAG CCG GGA CAG TCA 1344 
Lys Ala Val Thr Phe Val Glu Asn His Asp Thr Gin Pro Gly Gin Ser 
320 325 330 

TTG GAA TCG ACA GTC CAA ACT TGG TTT AAA CCG CTT GCA TAC GCC TTT 13 92 

Leu Glu Ser Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe 
335 340 345 350 

ATT TTG ACA AGA GAA TCC GGT TAT CCT CAG GTG TTC TAT GGG GAT ATG 1440 
lie Leu Thr Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met 
355 360 365 

TAC GGG ACA AAA GGG ACA TCG CCA AAG GAA ATT CCC TCA CTG AAA GAT 1488 
Tyr Gly Thr Lys Gly Thr Ser Pro Lys Glu lie Pro Ser Leu Lys Asp 
370 375 380 

AAT ATA GAG CCG ATT TTA AAA GCG CGT AAG GAG TAC GCA TAC GGG CCC 1536 
Asn lie Glu Pro He Leu Lys Ala Arg Lys Glu Tyr Ala Tyr Gly Pro 
385 390 395 

CAG CAC GAT TAT ATT GAC CAC CCG GAT GTG ATC GGA TGG ACG AGG GAA 1584 
Gin His Asp Tyr He Asp His Pro Asp Val He Gly Trp Thr Arg Glu 
400 405 410 

GGT GAC AGC TCC GCC GCC AAA TCA GGT TTG GCC GCT TTA ATC ACG GAC 1632 
Gly Asp Ser Ser Ala Ala Lys Ser Gly Leu Ala Ala Leu He Thr Asp 
415 420 425 430 

GGA CCC GGC GGA TCA AAG CGG ATG TAT GCC GGC CTG AAA AAT GCC GGC 168 0 

Gly Pro Gly Gly Ser Lys Arg Met Tyr Ala Gly Leu Lys Asn Ala Gly 
435 440 445 
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GAG ACA TGG TAT GAC ATA ACG GGC AAC CGT TCA GAT ACT GTA AAA ATC 1728 
Glu Thr Trp Tyr Asp lie Thr Gly Asn Arg Ser Asp Thr Val Lys lie 
450 455 460 

GGA TCT GAC GGC TGG GGA GAG TTT CAT GTA AAC GAT GGG TCC GTC TCC 177 6 

Gly Ser Asp Gly Trp Gly Glu Phe His Val Asn Asp Gly Ser Val Ser 
465 470 475 

ATT TAT GTT CAG AAA TAAGGTAATA AAAAAACACC TCCAAGCTGA GTGCGGGTAT 1831 
lie Tyr Val Gin Lys 
480 

CAGCTTGGAG GTGCGTTTAT TTTTTCAGCC GTATGACAAG GTCGGCATCA GGTGTGACAA 1891 

ATACGGTATG CTGGCTGTCA TAGGTGACAA ATCCGGGTTT TGCGCCGTTT GGCTTTTTCA 1951 

CATGTCTGAT TTTTGTATAA TCAACAGGCA CGGAGCCGGA ATCTTTCGCC TTGGAAAAAT 2 011 

AAGCGGCGAT CGTAGCTGCT TCCAATATGG ATTGTTCATC GGGATCGCTG CTTTTAATCA 2 071 

CAACGTGGGA TCC 2 084 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met He Gin Lys Arg Lys Arg Thr Val Ser Phe Arg Leu Val Leu Met 
-31 -30 -25 -20 

Cys Thr Leu Leu Phe Val Ser Leu Pro He Thr Lys Thr Ser Ala Val 
-15 -10 -5 1 

Asn Gly Thr Leu Met Gin Tyr Phe Glu Trp Tyr Thr Pro Asn Asp Gly 
5 10 15 

Gin His Trp Lys Arg Leu Gin Asn Asp Ala Glu His Leu Ser Asp He 
20 25 30 

Gly He Thr Ala Val Trp He Pro Pro Ala Tyr Lys Gly Leu Ser Gin 
35 40 45 

Ser Asp Asn Gly Tyr Gly Pro Tyr Asp Leu Tyr Asp Leu Gly Glu Phe 
50 55 60 65 

Gin Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ser Glu Leu 
70 75 80 

Gin Asp Ala He Gly Ser Leu His Ser Arg Asn Val Gin Val Tyr Gly 
85 90 95 

Asp Val Val Leu Asn His Lys Ala Gly Ala Asp Ala Thr Glu Asp Val 
100 105 110 

Thr Ala Val Glu Val Asn Pro Ala Asn Arg Asn Gin Glu Thr Ser Glu 
115 120 125 

Glu Tyr Gin He Lys Ala Trp Thr Asp Phe Arg Phe Pro Gly Arg Gly 
130 135 140 145 
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Asn Thr Tyr Ser Asp Phe Lys Trp His Trp Tyr His Phe Asp Gly Ala 
150 155 160 

Asp Trp Asp Glu Ser Arg Lys lie Ser Arg He Phe Lys Phe Arg Gly 
165 170 175 

Glu Gly Lys Ala Trp Asp Trp Glu Val Ser Ser Glu Asn Gly Asn Tyr 
180 185 190 

Asp Tyr Leu Met Tyr Ala Asp Val Asp Tyr Asp His Pro Asp Val Val 
195 200 205 

Ala Glu Thr Lys Lys Trp Gly He Trp Tyr Ala Asn Glu Leu Ser Leu 
210 215 220 225 

Asp Gly Phe Arg He Asp Ala Ala Lys His He Lys Phe Ser Phe Leu 
230 235 240 

Arg Asp Trp Val Gin Ala Val Arg Gin Ala Thr Gly Lys Glu Met Phe 
245 250 255 

Thr Val Ala Glu Tyr Trp Gin Asn Asn Ala Gly Lys Leu Glu Asn Tyr 
260 265 270 

Leu Asn Lys Thr Ser Phe Asn Gin Ser Val Phe Asp Val Pro Leu His 
275 280 285 

Phe Asn Leu Gin Ala Ala Ser Ser Gin Gly Gly Gly Tyr Asp Met Arg 
290 295 300 305 

Arg Leu Leu Asp Gly Thr Val Val Ser Arg His Pro Glu Lys Ala Val 
310 315 320 

Thr Phe Val Glu Asn His Asp Thr Gin Pro Gly Gin Ser Leu Glu Ser 
325 330 335 

Thr Val Gin Thr Trp Phe Lys Pro Leu Ala Tyr Ala Phe He Leu Thr 
340 345 350 

Arg Glu Ser Gly Tyr Pro Gin Val Phe Tyr Gly Asp Met Tyr Gly Thr 
355 360 365 

Lys Gly Thr Ser Pro Lys Glu He Pro Ser Leu Lys Asp Asn He Glu 
370 375 380 385 

Pro He Leu Lys Ala Arg Lys Glu Tyr Ala Tyr Gly Pro Gin His Asp 
390 395 400 

Tyr He Asp His Pro Asp Val He Gly Trp Thr Arg Glu Gly Asp Ser 
405 410 415 

Ser Ala Ala Lys Ser Gly Leu Ala Ala Leu He Thr Asp Gly Pro Gly 
420 425 430 

Gly Ser Lys Arg Met Tyr Ala Gly Leu Lys Asn Ala Gly Glu Thr Trp 
435 440 445 

Tyr Asp He Thr Gly Asn Arg Ser Asp Thr Val Lys He Gly Ser Asp 
450 455 460 465 

Gly Trp Gly Glu Phe His Val Asn Asp Gly Ser Val Ser He Tyr Val 
470 475 480 

Gin Lys 
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(2) INFORMATION FOR SEQ ID NO : 5 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 156.. 1802 

(ix) FEATURE: 

(A) NAME / KEY : sig_peptide 

(B) LOCATION: 156.. 257 

(ix) FEATURE: 

(A) NAME / KEY : mat_peptide 

(B) LOCATION: 258.. 1802 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

AAATTCGATA TTGAAAACGA TTACAAATAA AAATTATAAT AGACGTAAAC GTTCGAGGGT 6 0 

TTGCTCCCTT TTTACTCTTT TTATGCAATC GTTTCCCTTA ATTTTTTGGA AGCCAAACCG 12 0 

TCGAATGTAA CATTTGATTA AGGGGGAAGG GCATT GTG CTA ACG TTT CAC CGC 173 

Val ' Leu Thr Phe His Arg 
-34 -30 

ATC ATT CGA AAA GGA TGG ATG TTC CTG CTC GCG TTT TTG CTC ACT GTC 221 
lie lie Arg Lys Gly Trp Met Phe Leu Leu Ala Phe Leu Leu Thr Val 
-25 -20 -15 

TCG CTG TTC TGC CCA ACA GGA CAG CCC GCC AAG GCT GCC GCA CCG TTT 26 9 

Ser Leu Phe Cys Pro Thr Gly Gin Pro Ala Lys Ala Ala Ala Pro Phe 
-10 -5 1 

AAC GGC ACC ATG ATG CAG TAT TTT GAA TGG TAC TTG CCG GAT GAT GGC 317 
Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr Leu Pro Asp Asp Gly 
5 10 15 20 

ACG TTA TGG ACC AAA GTG GCC AAT GAA GCC AAC AAC TTA TCC AGC CTT 365 
Thr Leu Trp Thr Lys Val Ala Asn Glu Ala Asn Asn Leu Ser Ser Leu 
25 30 35 

GGC ATC ACC GCT CTT TGG CTG CCG CCC GCT TAC AAA GGA ACA AGC CGC 413 
Gly lie Thr Ala Leu Trp Leu Pro Pro Ala Tyr Lys Gly Thr Ser Arg 
40 45 . 50 

AGC GAC GTA GGG TAC GGA GTA TAC GAC TTG TAT GAC CTC GGC GAA TTC 461 
Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu Tyr Asp Leu Gly Glu Phe 
55 60 65 

AAT CAA AAA GGG ACC GTC CGC ACA AAA TAC GGA ACA AAA GCT CAA TAT 509 
Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly Thr Lys Ala Gin Tyr 
70 75 80 

CTT CAA GCC ATT CAA GCC GCC CAC GCC GCT GGA ATG CAA GTG TAC GCC 557 
Leu Gin Ala lie Gin Ala Ala His Ala Ala Gly Met Gin Val Tyr Ala 
85 90 95 100 

GAT GTC GTG TTC GAC CAT AAA GGC GGC GCT GAC GGC ACG GAA TGG GTG 605 
Asp Val Val Phe Asp His Lys Gly Gly Ala Asp Gly Thr Glu Trp Val 
105 110 lis 
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GAC GCC GTC GAA GTC AAT CCG TCC GAC CGC AAC CAA GAA ATC TCG GGC 653 
Asp Ala Val Glu Val Asn Pro Ser Asp Arg Asn Gin Glu lie Ser Gly 
120 125 130 

ACC TAT CAA ATC CAA GCA TGG ACG AAA TTT GAT TTT CCC GGG CGG GGC 701 
Tbr Tyr Gin lie Gin Ala Trp Thr Lys Phe Asp Phe Pro Gly Arg Gly 
135 140 145 

AAC ACC TAC TCC AGC TTT AAG TGG CGC TGG TAC CAT TTT GAC GGC GTT 74 9 

Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp Tyr His Phe Asp Gly Val 
150 155 160 

GAT TGG GAC GAA AGC CGA AAA TTG AGC CGC ATT TAC AAA TTC CGC GGC 797 
Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg lie Tyr Lys Phe Arg Gly 
165 170 175 180 

ATC GGC AAA GCG TGG GAT TGG GAA GTA GAC ACG GAA AAC GGA AAC TAT 84 5 

lie Gly Lys Ala Trp Asp Trp Glu Val Asp Thr Glu Asn Gly Asn Tyr 
185 190 195 

GAC TAC TTA ATG TAT GCC GAC CTT GAT ATG GAT CAT CCC GAA GTC GTG 893 
Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met Asp His Pro Glu Val Val 
200 205 210 

ACC GAG CTG AAA AAC TGG GGG AAA TGG TAT GTC AAC ACA ACG AAC ATT 941 
Thr Glu Leu Lys Asn Trp Gly Lys Trp Tyr Val Asn Thr Thr Asn lie 
215 220 225 

GAT GGG TTC CGG CTT GAT GCC GTC AAG CAT ATT AAG TTC AGT TTT TTT 989 
Asp Gly Phe Arg Leu Asp Ala Val Lys His lie Lys Phe Ser Phe Phe 
230 235 240 

CCT GAT TGG TTG TCG TAT GTG CGT TCT CAG ACT GGC AAG CCG CTA TTT 103 7 

Pro Asp Trp Leu Ser Tyr Val Arg Ser Gin Thr Gly Lys Pro Leu Phe 
245 250 255 260 

ACC GTC GGG GAA TAT TGG AGC TAT GAC ATC AAC AAG TTG CAC AAT TAC 108 5 

Thr Val Gly Glu Tyr Trp Ser Tyr Asp lie Asn Lys Leu His Asn Tyr 
265 270 275 

ATT ACG AAA ACA GAC GGA ACG ATG TCT TTG TTT GAT GCC CCG TTA CAC 1133 
lie Thr Lys Thr Asp Gly Thr Met Ser Leu Phe Asp Ala Pro Leu His 
280 285 290 

AAC AAA TTT TAT ACC GCT TCC AAA TCA GGG GGC GCA TTT GAT ATG CGC 1181 
Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly Gly Ala Phe Asp Met Arg 
295 300 305 

ACG TTA ATG ACC AAT ACT CTC ATG AAA GAT CAA CCG ACA TTG GCC GTC 122 9 

Thr Leu Met Thr Asn Thr Leu Met Lys Asp Gin Pro Thr Leu Ala Val 
310 315 320 

ACC TTC GTT GAT AAT CAT GAC ACC GAA CCC GGC CAA GCG CTG CAG TCA 12 77 

Thr Phe Val Asp Asn His Asp Thr Glu Pro Gly Gin Ala Leu Gin Ser 
325 330 335 340 

TGG GTC GAC CCA TGG TTC AAA CCG TTG GCT TAC GCC TTT ATT CTA ACT 132 5 

Trp Val Asp Pro Trp Phe Lys Pro Leu Ala Tyr Ala Phe lie Leu Thr 
345 350 355 

CGG CAG GAA GGA TAC CCG TGC GTC TTT TAT GGT GAC TAT TAT GGC ATT 13 73 

Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr Gly Asp Tyr Tyr Gly lie 
360 365 370 
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CCA CAA TAT AAC ATT CCT TCG CTG AAA AGC AAA ATC GAT CCG CTC CTC 1421 
Pro Gin Tyr Asn lie Pro Ser Leu Lys Ser Lys lie Asp Pro Leu Leu 
375 380 385 

ATC GCG CGC AGG GAT TAT GCT TAC GGA ACG CAA CAT GAT TAT CTT GAT 146 9 

lie Ala Arg Arg Asp Tyr Ala Tyr Gly Thr Gin His Asp Tyr Leu Asp 
390 395 400 

CAC TCC GAC ATC ATC GGG TGG ACA AGG GAA GGG GGC ACT GAA AAA CCA 1517 
His Ser Asp lie lie Gly Trp Thr Arg Glu Gly Gly Thr Glu Lys Pro 
405 410 415 420 

GGA TCC GGA CTG GCC GCA CTG ATC ACC GAT GGG CCG GGA GGA AGC AAA 1565 
Gly Ser Gly Leu Ala Ala Leu lie Thr Asp Gly Pro Gly Gly Ser Lys 
425 430 435 

TGG ATG TAC GTT GGC AAA CAA CAC GCT GGA AAA GTG TTC TAT GAC CTT 1613 
Trp Met Tyr Val Gly Lys Gin His Ala Gly Lys Val Phe Tyr Asp Leu 
440 445 450 

ACC GGC AAC CGG AGT GAC ACC GTC ACC ATC AAC AGT GAT GGA TGG GGG 1661 
Thr Gly Asn Arg Ser Asp Thr Val Thr lie Asn Ser Asp Gly Trp Gly 
455 460 465 

GAA TTC AAA GTC AAT GGC GGT TCG GTT TCG GTT TGG GTT CCT AGA AAA 17 09 

Glu Phe Lys Val Asn Gly Gly Ser Val Ser Val Trp Val Pro Arg Lys 
470 475 480 

ACG ACC GTT TCT ACC ATC GCT CGG CCG ATC ACA ACC CGA CCG TGG ACT 17 57 

Thr Thr Val Ser Thr lie Ala Arg Pro lie Thr Thr Arg Pro Trp Thr 
485 490 495 500 

GGT GAA TTC GTC CGT TGG ACC GAA CCA CGG TTG GTG GCA TGG CCT 1802 
Gly Glu Phe Val Arg Trp Thr Glu Pro Arg Leu Val Ala Trp Pro 
505 510 515 

TGATGCCTGC GA 1814 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Val Leu Thr Phe His Arg lie lie Arg Lys Gly Trp Met Phe Leu Leu 
-34 -30 -25 -20 

Ala Phe Leu Leu Thr Val Ser Leu Phe Cys Pro Thr Gly Gin Pro Ala 
-15 -10 -5 

Lys Ala Ala Ala Pro Phe Asn Gly Thr Met Met Gin Tyr Phe Glu Trp 
15 10 

Tyr Leu Pro Asp Asp Gly Thr Leu Trp Thr Lys Val Ala Asn Glu Ala 
15 20 25 30 

Asn Asn Leu Ser Ser Leu Gly lie Thr Ala Leu Trp Leu Pro Pro Ala 
35 40 45 

Tyr Lys Gly Thr Ser Arg Ser Asp Val Gly Tyr Gly Val Tyr Asp Leu 
50 55 60 
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Tyr Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr 
65 70 75 

Gly Thr Lys Ala Gin Tyr Leu Gin Ala lie Gin Ala Ala His Ala Ala 
80 85 90 

Gly Met Gin Val Tyr Ala Asp Val Val Phe Asp His Lys Gly Gly Ala 
95 100 105 110 

Asp Gly Thr Glu Trp Val Asp Ala Val Glu Val Asn Pro Ser Asp Arg 
115 120 125 

Asn Gin Glu lie Ser Gly Thr Tyr Gin lie Gin Ala Trp Thr Lys Phe 
130 135 140 

Asp Phe Pro Gly Arg Gly Asn Thr Tyr Ser Ser Phe Lys Trp Arg Trp 
145 150 155 

Tyr His Phe Asp Gly Val Asp Trp Asp Glu Ser Arg Lys Leu Ser Arg 
160 165 170 

He Tyr Lys Phe Arg Gly He Gly Lys Ala Trp Asp Trp Glu Val Asp 
175 180 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Leu Asp Met 
195 200 205 

Asp His Pro Glu Val Val Thr Glu Leu Lys Asn Trp Gly Lys Trp Tyr 
210 215 220 

Val Asn Thr Thr Asn He Asp Gly Phe Arg Leu Asp Ala Val Lys His 
225 230 235 

He Lys Phe Ser Phe Phe Pro Asp Trp Leu Ser Tyr Val Arg Ser Gin 
240 245 250 

Thr Gly Lys Pro Leu Phe Thr Val Gly Glu Tyr Trp Ser Tyr Asp He 
255 260 265 270 

Asn Lys Leu His Asn Tyr He Thr Lys Thr Asp Gly Thr Met Ser Leu 
275 280 285 

Phe Asp Ala Pro Leu His Asn Lys Phe Tyr Thr Ala Ser Lys Ser Gly 
290 295 300 

Gly Ala Phe Asp Met Arg Thr Leu Met Thr Asn Thr Leu Met Lys Asp 
305 310 315 

Gin Pro Thr Leu Ala Val Thr Phe Val Asp Asn His Asp Thr Glu Pro 
320 325 330 

Gly Gin Ala Leu Gin Ser Trp Val Asp Pro Trp Phe Lys Pro Leu Ala 
335 340 345 350 

Tyr Ala Phe He Leu Thr Arg Gin Glu Gly Tyr Pro Cys Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly He Pro Gin Tyr Asn He Pro Ser Leu Lys Ser 
370 375 380 

Lys He Asp Pro Leu Leu He Ala Arg Arg Asp Tyr Ala Tyr Gly Thr 
385 390 395 

Gin His Asp Tyr Leu Asp His Ser Asp He He Gly Trp Thr Arg Glu 
400 405 410 
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Gly Gly Thr Glu 
415 

Gly Pro Gly Gly 



Lys Val Phe Tyr 
450 

Asn Ser Asp Gly 
465 

Val Trp Val Pro 
480 

Thr Thr Arg Pro 
495 

Leu Val Ala Trp 



Lys Pro Gly Ser 
420 

Ser Lys Trp Met 
435 

Asp Leu Thr Gly 



Trp Gly Glu Phe 
470 

Arg Lys Thr Thr 
485 

Trp Thr Gly Glu 
500 

Pro 
515 



Gly Leu Ala Ala 
425 

Tyr Val Gly Lys 
440 

As n Arg Ser Asp 
455 

Lys Val Asn Gly 



Val Ser Thr lie 
490 

Phe Val Arg Trp 
505 



Leu lie Thr Asp 

430 

Gin His Ala Gly 
445 

Thr Val Thr He 
460 

Gly Ser 'Val Ser 
475 

Ala Arg Pro He 



Thr Glu Pro Arg 
510 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GGTCGTAGGC ACCGTAGCCC CAATCCGCTT G 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GGTCGTAGGC ACCGTAGCCC CAATCCCATT GGCTCG 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CTGTGACTGG TGAGTACTCA ACCAAGTC 
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INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 478 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Ala Thr Pro Ala Asp Trp Arg Ser Gin Ser lie Tyr Phe Leu Leu Thr 
15 10 15 

Asp Arg Phe Ala Arg Thr Asp Gly Ser Thr Thr Ala Thr Cys Asn Thr 
20 25 30 

Ala Asp Gin Lys Tyr Cys Gly Gly Thr Trp Gin Gly lie lie Asp Lys 
35 40 45 

Leu Asp Tyr lie Gin Gly Met Gly Phe Thr Ala lie Trp lie Thr Pro 
50 55 60 

Val Thr Ala Gin Leu Pro Gin Thr Thr Ala Tyr Gly Asp Ala Tyr His 
65 70 75 80 

Gly Tyr Trp Gin Gin Asp lie Tyr Ser Leu Asn Glu Asn Tyr Gly Thr 
85 90 95 

Ala Asp Asp Leu Lys Ala Leu Ser Ser Ala Leu His Glu Arg Gly Met 
100 105 110 

Tyr Leu Met Val Asp Val Val Ala Asn His Met Gly Tyr Asp Gly Ala 
115 120 125 

Gly Ser Ser Val Asp Tyr Ser Val Phe Lys Pro Phe Ser Ser Gin Asp 
130 135 140 

Tyr Phe His Pro Phe Cys Phe lie Gin Asn Tyr Glu Asp Gin Thr Gin 
145 150 155 160 

Val Glu Asp Cys Trp Leu Gly Asp Asn Thr Val Ser Leu Pro Asp Leu 
165 170 175 

Asp Thr Thr Lys Asp Val Val Lys Asn Glu Trp Tyr Asp Trp Val Gly 
180 185 190 

Ser Leu Val Ser Asn Tyr Ser lie Asp Gly Leu Arg lie Asp Thr Val 
195 200 205 

Lys His Val Gin Lys Asp Phe Trp Pro Gly Tyr Asn Lys Ala Ala Gly 
210 215 220 

Val Tyr Cys lie Gly Glu Val Leu Asp Gly Asp Pro Ala Tyr Thr Cys 
225 230 235 240 

Pro Tyr Gin Asn Val Met Asp Gly Val Leu Asn Tyr Pro lie Tyr Tyr 
245 250 255 

Pro Leu Leu Asn Ala Phe Lys Ser Thr Ser Gly Ser Met Asp Asp Leu 
260 265 270 

Tyr Asn Met lie Asn Thr Val Lys Ser Asp Cys Pro Asp Ser Thr Leu 
275 280 285 
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Leu Gly Thr Phe Val Glu Asn His Asp Asn Pro Arg Phe Ala Ser Tyr 
290 295 300 



Thr Asn Asp lie Ala Leu Ala Lys Asn Val Ala Ala Phe lie lie Leu 
305 310 315 320 

Asn Asp Gly lie Pro lie lie Tyr Ala Gly Gin Glu Gin His Tyr Ala 
325 330 335 

Gly Gly Asn Asp Pro Ala Asn Arg Glu Ala Thr Trp Leu Ser Gly Tyr 
340 345 350 

Pro Thr Asp Ser Glu Leu Tyr Lys Leu lie Ala Ser Ala Asn Ala lie 
355 360 365 

Arg Asn Tyr Ala lie Ser Lys Asp Thr Gly Phe Val Thr Tyr Lys Asn 
370 375 380 

Trp Pro lie Tyr Lys Asp Asp lie Thr lie Ala Met Arg Lys Gly Thr 
385 390 395 400 

Asp Gly Ser Gin lie Val Thr lie Leu Ser Asn Lys Gly Ala Ser Gly 
405 410 415 

Asp Ser Tyr Thr Leu Ser Leu Ser Gly Ala Gly Tyr Thr Ala Gly Gin 
420 425 430 

Gin Leu Thr Glu Val lie Gly Cys Thr Thr Val Thr Val Gly Ser Asp 
435 440 445 

Gly Asn Val Pro Val Pro Met Ala Gly Gly Leu Pro Arg Val Leu Tyr 
450 455 460 

Pro Thr Glu Lys Leu Ala Gly Ser Lys lie Cys Ser Ser Ser 
465 470 475 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1455 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CAT CAT AAT GGA ACA AAT GGT ACT ATG ATG CAA TAT TTC GAA TGG TAT 48 
His His Asn Gly Thr Asn Gly Thr Met Met Gin Tyr Phe Glu Trp Tyr 
520 525 530 

TTG CCA AAT GAC GGG AAT CAT TGG AAC AGG TTG AGG GAT GAC GCA GCT 96 
Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 
535 540 545 

AAC TTA AAG AGT AAA GGG ATA ACA GCT GTA TGG ATC CCA CCT GCA TGG 144 
Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie Pro Pro Ala Trp 
550 555 560 

AAG GGG ACT TCC CAG AAT GAT GTA GGT TAT GGA GCC TAT GAT TTA TAT 192 
Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
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565 570 575 

GAT CTT GGA GAG TTT AAC CAG AAG GGG ACG GTT CGT ACA AAA TAT GGA 24 0 

Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
580 585 590 595 

ACA CGC AAC CAG CTA CAG GCT GCG GTG ACC TCT TTA AAA AAT AAC GGC 28 8 

Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
600 605 610 

ATT CAG GTA TAT GGT GAT GTC GTC ATG AAT CAT AAA GGT GGA GCA GAT 336 
lie Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
615 620 625 

GGT ACG GAA ATT GTA AAT GCG GTA GAA GTG AAT CGG AGC AAC CGA AAC 384 
Gly Thr Glu lie Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 
630 635 . 640 

CAG GAA ACC TCA GGA GAG TAT GCA ATA GAA GCG TGG ACA AAG TTT GAT 432 
Gin Glu Thr Ser Gly Glu Tyr Ala lie Glu Ala Trp Thr Lys Phe Asp 
645 650 655 

TTT CCT GGA AGA GGA AAT AAC CAT TCC AGC TTT AAG TGG CGC TGG TAT 48 0 

Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 
660 665 670 675 

CAT TTT GAT GGG ACA GAT TGG GAT CAG TCA CGC CAG CTT CAA AAC AAA 528 
His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 
680 685 690 

ATA TAT AAA TTC AGG GGA ACA GGC AAG GCC TGG GAC TGG GAA GTC GAT 5 76 

lie Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 
695 700 705 

ACA GAG AAT GGC AAC TAT GAC TAT CTT ATG TAT GCA GAC GTG GAT ATG 624 
Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
710 715 720 

GAT CAC CCA GAA GTA ATA CAT GAA CTT AGA AAC TGG GGA GTG TGG TAT 6 72 

Asp His Pro Glu Val lie His Glu Leu Arg Asn Trp Gly Val Trp Tyr 
725 730 735 

ACG AAT ACA CTG AAC CTT GAT GGA TTT AGA ATA GAT GCA GTG AAA CAT 720 
Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg lie Asp Ala Val Lys His 
740 745 750 755 

ATA AAA TAT AGC TTT ACG AGA GAT TGG CTT ACA CAT GTG CGT AAC ACC 768 
lie Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
760 765 770 

ACA GGT AAA CCA ATG TTT GCA GTG GCT GAG TTT TGG AAA AAT GAC CTT 816 
Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
775 780 785 

GGT GCA ATT GAA AAC TAT TTG AAT AAA ACA AGT TGG AAT CAC TCG GTG 864 
Gly Ala lie Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 
790 795 800 

TTT GAT GTT CCT CTC CAC TAT AAT TTG TAC AAT GCA TCT AAT AGC GGT 912 
Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
805 810 ' 815 

GGT TAT TAT GAT ATG AGA AAT ATT TTA AAT GGT TCT GTG GTG CAA AAA 960 
Gly Tyr Tyr Asp Met Arg Asn lie Leu Asn Gly Ser Val Val Gin Lys 
820 825 830 835 

CAT CCA ACA CAT GCC GTT ACT TTT GTT GAT AAC CAT GAT TCT CAG CCC 10 08 
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His Pro Thr His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
840 845 850 

GGG GAA GCA TTG GAA TCC TTT GTT CAA CAA TGG TTT AAA CCA CTT GCA 1056 
Gly Glu Ala Leu Glu Ser Phe Val Gin Gin Trp Phe Lys Pro Leu Ala 
855 860 865 

TAT GCA TTG GTT CTG ACA AGG GAA CAA GGT TAT CCT TCC GTA TTT TAT 1104 
Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
870 875 880 

GGG GAT TAC TAC GGT ATC CCA ACC CAT GGT GTT CCG GCT ATG AAA TCT 1152 
Gly Asp Tyr Tyr Gly lie Pro Thr His Gly Val Pro Ala Met Lys Ser 
885 890 895 

AAA ATA GAC CCT CTT CTG CAG GCA CGT CAA ACT TTT GCC TAT GGT ACG 12 00 

Lys lie Asp Pro Leu Leu Gin Ala Arg Gin Thr Phe Ala Tyr Gly Thr 
900 905 910 915 

CAG CAT GAT TAC TTT GAT CAT CAT GAT ATT ATC GGT TGG ACA AGA GAG 1248 
Gin His Asp Tyr Phe Asp His His Asp lie lie Gly Trp Thr Arg Glu 
920 925 930 

GGA AAT AGC TCC CAT CCA AAT TCA GGC CTT GCC ACC ATT ATG TCA GAT 12 96 

Gly Asn Ser Ser His Pro Asn Ser Gly Leu Ala Thr lie Met Ser Asp 
935 940 945 

GGT CCA GGT GGT AAC AAA TGG ATG TAT GTG GGG AAA AAT AAA GCG GGA 1344 
Gly Pro Gly Gly Asn Lys Trp Met Tyr Val Gly Lys Asn Lys Ala Gly 
950 955 960 

CAA GTT TGG AGA GAT ATT ACC GGA AAT AGG ACA GGC ACC GTC ACA ATT 13 92 

Gin Val Trp Arg Asp lie Thr Gly Asn Arg Thr Gly Thr Val Thr lie 
965 970 - 975 

AAT GCA GAC GGA TGG GGT AAT TTC TCT GTT AAT GGA GGG TCC GTT TCG 1440 
Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
980 985 990 995 

GTT TGG GTG AAG CAA TAA 14 58 

Val Trp Val Lys Gin 
1000 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

His His Asn Gly Thr Asn Gly Thr Met Met Glh Tyr Phe Glu Trp Tyr 
15 10 15 

Leu Pro Asn Asp Gly Asn His Trp Asn Arg Leu Arg Asp Asp Ala Ala 
20 25 30 

Asn Leu Lys Ser Lys Gly lie Thr Ala Val Trp lie Pro Pro Ala Trp 
35 40 45 

Lys Gly Thr Ser Gin Asn Asp Val Gly Tyr Gly Ala Tyr Asp Leu Tyr 
50 55 60 
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Asp Leu Gly Glu Phe Asn Gin Lys Gly Thr Val Arg Thr Lys Tyr Gly 
65 70 75 80 

Thr Arg Asn Gin Leu Gin Ala Ala Val Thr Ser Leu Lys Asn Asn Gly 
85 90 95 

He Gin Val Tyr Gly Asp Val Val Met Asn His Lys Gly Gly Ala Asp 
100 105 110 

Gly Thr Glu He Val Asn Ala Val Glu Val Asn Arg Ser Asn Arg Asn 
115 120 125 

Gin Glu Thr Ser Gly Glu Tyr Ala He Glu Ala Trp Thr Lys Phe Asp 
130 135 140 

Phe Pro Gly Arg Gly Asn Asn His Ser Ser Phe Lys Trp Arg Trp Tyr 
145 150 155 160 

His Phe Asp Gly Thr Asp Trp Asp Gin Ser Arg Gin Leu Gin Asn Lys 
165 170 175 

He Tyr Lys Phe Arg Gly Thr Gly Lys Ala Trp Asp Trp Glu Val Asp 
180 185 190 

Thr Glu Asn Gly Asn Tyr Asp Tyr Leu Met Tyr Ala Asp Val Asp Met 
195 200 205 

Asp His Pro Glu Val He His Glu Leu Arg Asn Trp Gly Val Trp Tyr 
210 215 220 

Thr Asn Thr Leu Asn Leu Asp Gly Phe Arg He Asp Ala Val Lys His 
225 230 235 240 

He Lys Tyr Ser Phe Thr Arg Asp Trp Leu Thr His Val Arg Asn Thr 
245 250 255 

Thr Gly Lys Pro Met Phe Ala Val Ala Glu Phe Trp Lys Asn Asp Leu 
260 265 270 

Gly Ala He Glu Asn Tyr Leu Asn Lys Thr Ser Trp Asn His Ser Val 
275 280 285 

Phe Asp Val Pro Leu His Tyr Asn Leu Tyr Asn Ala Ser Asn Ser Gly 
290 295 300 

Gly Tyr Tyr Asp Met Arg Asn He Leu Asn Gly Ser Val Val Gin Lys 
305 310 315 320 

His Pro Thr His Ala Val Thr Phe Val Asp Asn His Asp Ser Gin Pro 
325 330 335 

Gly Glu Ala Leu Glu Ser Phe Val Gin Gin Trp Phe Lys Pro Leu Ala 
340 345 350 

Tyr Ala Leu Val Leu Thr Arg Glu Gin Gly Tyr Pro Ser Val Phe Tyr 
355 360 365 

Gly Asp Tyr Tyr Gly He Pro Thr His Gly Val Pro Ala Met Lys Ser 
370 375 380 

Lys He Asp Pro Leu Leu Gin Ala Arg Gin Thr Phe Ala Tyr Gly Thr 
385 390 395 400 

Gin His Asp Tyr Phe Asp His His Asp He He Gly Trp Thr Arg Glu 
405 410 415 
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Gly Asn Ser Ser His Pro Asn Ser Gly Leu Ala Thr lie Met Ser Asp 

420 425 430 



Gly Pro Gly Gly Asn Lys Trp Met Tyr Val Gly Lys Asn Lys Ala Gly 

435 440 445 

Gin Val Trp Arg Asp lie Thr Gly Asn Arg Thr Gly Thr Val Thr lie 
450 455 460 

Asn Ala Asp Gly Trp Gly Asn Phe Ser Val Asn Gly Gly Ser Val Ser 
465 470 475 480 

Val Trp Val Lys Gin 
485 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 





(ii) MOLECULE 


TYPE : protein 


















(xi) SEQUENCE 


DESCRIPTION: 


: SEQ ID 


NO: 13 : 










Val 


Asn 


Gly 


Thr 


Leu 


Met 


Gin 


Tyr 


Phe 


Glu 


Trp 


Tyr 


Thr 


Pro 


Asn 


Asp 


1 








5 










10 










15 




Gly 


Gin 


His 


Trp 


Lys 


Arg 


Leu 


Gin 


Asn 


Asp 


Ala 


Glu 


His 


Leu 


Ser 


Asp 








20 










25 










30 






lie 


Gly 


lie 


Thr 


Ala 


Val 


Trp 


He 


Pro 


Pro 


Ala 


Tyr 


Lys 


Gly 


Leu 


Ser 






35 










40 










45 








Gin 


Ser 


Asp 


Asn 


Gly 


Tyr 


Gly 


Pro 


Tyr 


Asp 


Leu 


Tyr 


Asp 


Leu 


Gly Glu 




50 










55 










60 










Phe 


Gin 


Gin 


Lys 


Gly 


Thr 


Val 


Arg 


Thr 


Lys 


Tyr 


Gly Thr 


Lys 


Ser 


Glu 


65 










70 










75 










80 


Leu 


Gin 


Asp 


Ala 


He 


Gly 


Ser 


Leu 


His 


Ser 


Arg 


Asn 


Val 


Gin 


Val 


Tyr 










85 










90 










95 




Gly 


Asp 


Val 


Val 


Leu 


Asn 


His 


Lys 


Ala 


Gly 


Ala 


Asp 


Ala 


Thr 


Glu 


Asp 








100 










105 










110 






Val 


Thr 


Ala 


Val 


Glu 


Val 


Asn 


Pro 


Ala 


Asn 


Arg 


Asn 


Gin 


Glu 


Thr 


Ser 






115 










120 










125 








Glu 


Glu 


Tyr 


Gin 


He 


Lys 


Ala 


Trp 


Thr 


Asp 


Phe 


Arg 


Phe 


Pro 


Gly Arg 




130 










135 










140 










Gly 


Asn 


Thr 


Tyr 


Ser 


Asp 


Phe 


Lys 


Trp 


His 


Trp 


Tyr 


His 


Phe 


Asp 


Gly 


145 










150 










155 










160 


Ala 


Asp 


Trp 


Asp 


Glu 


Ser 


Arg 


Lys 


He 


Ser 


Arg 


He 


Phe 


Lys 


Phe 


Arg 










165 










170 










175 




Gly 


Glu 


Gly 


Lys 


Ala 


Trp 


Asp 


Trp 


Glu 


Val 


Ser 


Ser 


Glu 


Asn 


Gly Asn 








180 










185 










190 






Tyr 


Asp 


Tyr 


Leu 


Met 


Tyr 


Ala 


Asp 


Val 


Asp 


Tyr 


Asp 


His 


Pro 


Asp 


Val 






195 










200 










205 








Val 


Ala 


Glu 


Thr 


Lys 


Lys 


Trp 


Gly 


He 


Trp 


Tyr 


Ala 


Asn 


Glu 


Leu 


Ser 




210 










215 










220 










Leu 


Asp 


Gly 


Phe 


Arg 


He 


Asp 


Ala 


Ala 


Lys 


His 


He 


Lys 


Phe 


Ser 


Phe 


225 










230 










235 










240 


Leu 


Arg 


Asp 


Trp 


Val 


Gin 


Ala 


Val 


Arg 


Gin 


Ala 


Thr 


Gly 


Lys 


Glu 


Met 










245 










250 










255 




Phe 


Thr 


Val 


Ala 


Glu 


Tyr 


Trp 


Gin 


Asn 


Asn 


Ala 


Gly Lys 


Leu 


Glu 


Asn 








260 










265 










270 






Tyr 


Leu 


Asn 


Lys 


Thr 


Ser 


Phe 


Asn 


Gin 


Ser 


Val 


Phe 


Asp 


Val 


Pro 


Leu 






275 










280 










285 








His 


Phe 


Asn 


Leu 


Gin 


Ala 


Ala 


Ser 


Ser 


Gin 


Gly 


Gly Gly 


Tyr 


Asp 


Met 




290 










295 










300 










Arg 


Lys 


Leu 


Leu 


Asn 


Gly 


Thr 


Val 


Val 


Ser 


Lys 


His 


Pro 


Leu 


Lys 


Ser 


305 










310 










315 










320 



88 



Val 


Thr 


Phe 


Val 


Ser 


Thr 


Val 


Gin 








340 


Thr 


Arg 


Glu 


Ser 






355 




Thr 


Lys 


Gly Asp 




370 






Glu 


Pro 


He 


Leu 


385 








Asp 


Tyr 


Phe 


Asp 


Ser 


Ser 


Val 


Ala 








420 


Gly 


Gly 


Ala 


Lys 






435 




Trp 


His 


Asp 


He 




450 






Glu 


Gly 


Trp 


Gly 


46 5 








Val 


Gin 


Arg 





Asp Asn His Asp 

325 

Thr Trp Phe Lys 

Gly Tyr Pro Gin 
360 

Ser Gin Arg Glu 
375 

Lys Ala Arg Lys 
390 

His His Asp He 
405 

Asn Ser Gly Leu 

Arg Met Tyr Val 
440 

Thr Gly Asn Arg 
455 

Glu Phe His Val 
470 



Thr Gin Pro Gly 
330 

Pro Leu Ala Tyr 
345 

Val Phe Tyr Gly 

He Pro Ala Leu 
380 

Gin Tyr Ala Tyr 
395 

Val Gly Trp Thr 
410 

Ala Ala Leu He 
425 

Gly Arg Gin Asn 

Ser Glu Pro Val 
460 

Asn Gly Gly Ser 
475 



Gin Ser Leu Glu 
335 

Ala Phe He Leu 
350 

Asp Met Tyr Gly 
365 

Lys His Lys He 

Gly Ala Gin His 
400 

Arg Glu Gly Asp 
415 

Thr Asp Gly Pro 
430 

Ala Gly Glu Thr 
445 

Val He Asn Ser 

Val Ser He Tyr 
480 
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